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COMPOUNDS USEFUL IN THE DIAGNOSIS AND TREATMENT OF MALARIA 
Field of the Invention 

The present invention relates to the fields of preventing or treating malaria and it provides 
compounds, which are useful within these fields. These compounds may constitute parts of 
5 pharmaceutical compositions and vaccines and be used in methods of treatment, as 
medicaments and for the manufacture of compositions and/or these compounds may 
provide basis for a method of generating a vaccine against malaria. Furthermore, the 
invention relates to the use of these compounds as biotechnological tools and in in vitro 
diagnostic methods and kits. 

10 General background 

Malaria constitutes a permanent catastrophe. Annually, the disease kills between 1 and 2 
million Africans and the economic losses due to malaria constitute a hindrance for 
economic development. In areas of stable malaria transmission the disease mainly affects 
children, because adults have acquired immunity. In these areas immunity to severe 

15 malaria and protection against malaria deaths is acquired early in life after a few clinical 
infections, whereas the acquisition of immunity that protect individuals from uncomplicated 
febrile malaria episodes is a much more sluggish process, requiring years of exposure. 
Thus, the epidemiological data indicate that immunity against severe disease is mediated 
through a different mechanism than immunity against uncomplicated disease, and that the 

20 targets on the parasites for these two kinds of malaria immunity are separate. 

Malaria is caused by unicellular parasites living and multiplying asexually in the red blood 
ceils (RBC). In each 48-hour cycle, the parasites invade RBC, multiply within them, and 
eventually burst them, before they go on to invade new RBC. 

25 

Plasmodium falciparum is the most virulent of the four species causing malaria and 
responsible for most malarial deaths. The particular virulence of P. falciparum is due to the 
ability of infected erythrocytes to adhere to a variety of host receptors on the endothelial 
lining such as ICAM-1, VCAM, thrombospondin, ELAM-1, and CD36, and avoid splenic 
30 clearance. Unchecked growth and the accumulation of sequestered parasites in vital 
organs such as the brain are crucial elements in the pathogenesis of severe malaria. 

Sequestration is mediated through parasite-encoded, clonally variant surface antigens 
(VSA) inserted into the membrane of the infected RBC and is thought to be an immune 
35 evasion strategy evolved to avoid splenic clearance. The VSA expressed on infected 
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erythrocytes can be divided into serological types using plasma from individuals living in 
malaria endemic areas as typing reagents. 

By flow cytometry analyses, two main serotypes have been defined, one, VSA SM is mainly 
5 expressed by parasites causing severe and life-threatening P. falciparum malaria, and the 
other, VSA UM is dominantiy expressed during uncomplicated malaria infections in semi- 
immune individuals. 

Naturally acquired immunity to malaria is mediated by plasma IgG, which control the 
10 growth of P. falciparum, VSAs are probably the main target of these antibodies, as 

acquisition of protection from P. falciparum malaria corresponds to a gradual accumulation 
of IgG with a broad range of VSA-specificities. Furthermore, VSA-specific immune 
responses steadily restrict the repertoire of VSA that are compatible with parasite survival, 
and drive VSA expression away from VSA SM towards VSAy^ (Nielsen et al., 2002). VSA 
15 expression therefore is non-random as it depends on the degree of immunity in the 
infected host. 

The best-characterised VSA are encoded by the var genes. This gene family, encompassing 
about 60 members per genome, encodes a highly polymorphic set of variant proteins, 
20 which collectively have been named P. faiciparum erythrocyte membrane protein 1 
(PfEMPl). These proteins are located on the surface of the P. fa/c/parum-infected 
erythrocytes and have been shown to mediate adhesion to a number of host receptors. 

The var genes contain a large variable 5' exon and a more conserved 3' exon separated by 
25 an intron. The 5' exon normally contains one or two Cysteine-rich Inter-Domain Regions 
(CIDR) and from two to seven domains designated DBL after their similarity to the Duffy 
Binding Ligand of P. vivax (Smith et al. 2000). A given parasite expresses only one PfEMPl 
at a time, but in each generation a fraction of the daughter parasites may switch to 
expression of alternative PfEMPl species through an unknown process. Different PfEMPl 
30 molecules have different receptor specificities, and clonal switching between expression of 
the various var gene products in a mutually exclusive manner allows the parasite to modify 
its adhesion properties (Wahlgren et al., 1999). 

Thus, the currently available data indicate that severe malaria is caused by parasites 
35 expressing a subset of PfEMPl molecules and that antibodies directed against these are 
responsible for the protection against severe disease acquired early in life by children in 
endemic areas. It follows that the specific identification of the particular PfEMPl molecules 
expressed by the parasites causing severe malaria is important for malaria vaccine 
development. 
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The obvious strategies followed in many laboratories are either to attempt to identify the 
var gene that is dominantly transcribed or to determine which PfEMPl is expressed by 
parasites isolated from patients suffering from severe malaria. 

5 

Both strategies have been halted because the var genes show extensive intra- and inter- 
genomic variation. 

Attempts to identify VSA S M-type var gene transcription has been foiled because primer bias 
10 and concomitant transcription of several var genes makes it is impossible to quantify and 
compare transcription of these genes, when all the target nucleic acid sequences are not 
known. The proteomic approach has proved difficult, partly due to the variation in the 
repertoire already mentioned, partly because PfEMPl is expressed at low levels on the 
surface of erythrocytes, and it has been difficult to obtain sufficient amounts of protein for 
15 a reliable mass spectrometry approach. 

Frustrated by these efforts several laboratories has tried to identify the most relevant 
PfEMPl by generating parasite lines with known adhesion phenotype using repeated 
rounds of panning and identifying the dominantly transcribed var genes. This approach has 
20 identified PfEMPl molecules with a known receptor affinity, but since parasites causing 
severe malaria have never been shown to possess a common binding phenotype, the 
relevance of these PfEMPl molecules in the pathogenesis of severe malaria is unclear. 

The PfEMPls constitute a large and polymorphic family. The proteins described in this 
25 application serve a unique function for the parasite in providing high growths rates in non- 
immune individuals and renders these parasites a unique serological phenotype. This 
discovery is of great potential importance because it makes it possible to design strategies 
that will specifically aim at reducing malaria deaths by preventing, diagnosing, and treating 
severe malaria. 

30 

The entire P. falciparum genome of one parasite isolate, 3D7, have previously been data- 
mined using bioinformatic tools see Gardner et al. 

This article states that products of the var gene family are central to the pathogenesis of 
35 malaria and to the induction of protective immunity in general. However the document 
does not led the skilled addressee to any conclusion of which specific var genes of 3D7 are 
functional relevant for the development of a vaccine against malaria. 
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Several have submitted various sequence to e.g. the UniProt Database, these submissons 
are merely sequences and does not contain any information on the function of the 
sequences nor any guidence for use of these sequences. 

5 Wahlgren M. et al. is a minireview, which states that the Plasmodium falciparum var gene 
repertoire cannot be static in an area where malaria is endemic and that common groups 
of var genes are not shared between different parasites. It states that if the transmission 
rate is high, it is expected that the var gene repertoire is significantly enlarged by frequent 
fusions of different haploid genomes in the mosquito. The document states that 
10 breakthroughs in the molecular understanding of parasitism may lead to both new drugs 
and new vaccine and that PfEMPl is a promising target candidate as its function, critically 
affecting parasite virulence, is currently being unveiled. 

Wahlgren M. et al. only mention PfEMPl as a potential vaccine candidate and does not 
15 point to any specific PfEMPl as being particular relevant as such. By contrast, the 

document states that the var gene repertoire is likely to be significantly enlarged in areas 
of high malaria transmission. Severe malaria is more frequent in areas of high malaria as 
compared to areas of lower transmission. 

20 Salanti et al., reports an analysis of the gene expression profile of a specific parasite-line 
202-CSA and identification of a var gene 202varl r which is unusually similar to another 
previously described var gene FCR3varCSA. These two var genes and the 3D7 PFE1640w 
belongs to a sub-family of var genes named varl. var genes can be sub-grouped into 3 
major groups A, B, and C. 

25 

This reference only describes the sequencing of varl from different field isolates and thus 
do not lead to the conclusion that specific var genes of 3D7 are functional relevant for the 
development of e.g. a vaccine against severe malaria. 

30 Based on both coding- and non-coding regions var genes have been found to group into 
three major groups (group A, B and C) and two intermediate groups B/A and B/C 
representing transitions between the three major groups. Group A consists often genes 
consistently identified as a distinct group by sequence analysis (Lavstsen et al. 2003). 

35 Although the proposed grouping of var genes in 3D7 is common knowledge to a person 
skilled in the art and the sequence of the entire 3D7 P. falciparum genome is known as 
sequence submissions, information on function of SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ 
ID NO.: 6 in the pathogenesis of malaria and their relevance for a vaccine against severe 
malaria has not previously been described. 

40 

In the 3D7 isolate, Group A contains 10 var genes of which we have identified three var 
genes, using a novel method, as being functionally relevant for development of a vaccine 
agasint severe malaria. However, the global repertoire of var genes is unknown, but 
estimated to encode more than 10 27 possible variants PfEMPl . 
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Summary of the Invention 

In essence, the inventive concept described herein is based on the observation that three 
genes, PFD1235w/MAL7Pl.l, PF11_0008, and PF13_0003 (referred to as SEQ ID NO.: 1, 
5 SEQ ID NO. : 3, and SEQ ID NO.: 5) are transcriptionally up-regulated in parasites of the 
species Plasmodium falciparum, when these parasites have been selected for increased 
antibody recognition by a novel selection method, described in the present application, 
using plasma from semi-immune children. This selection conway parasites a unique 
phenotype characteristic of parasites causing severe malaria. This phenotype is also 

10 obtained when parasites are selected for adhesion to a special kind of bone marrow 

derived endothelical ceils and also results in the up-reguiation of SEQ ID NO.: 1, SEQ ID 
NO.: 3, and SEQ ID NO.: 5. As the cytoadhesion to human endothelial cells and the 
antibody recognition of infected RBC is intimately linked to severe malaria, products of 
these genes provide for novel approaches to diagnosing and treating malaria 

15 prophylactically and/or therapeutically. 

In the broadest sense, the present invention relates to the polypeptides, VAR4, VAR5, and 
VAR6 (referred to as SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ ID NO.: 6) encoded by SEQ 
ID NO.: 1, SEQ ID NO.: 3, and SEQ ID NO.: 5, respectively or the codon optimised SEQ ID 
20 NO.: 7 encoding SEQ ID NO.: 2 or parts hereof as well as polypeptides, which with respect 
to their sequence are identical in part to such sequences. In addition, the invention relates 
to the SEQ ID NO.: 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 nucleic acid molecule or the 
codon optimised SEQ ID NO.: 7 or parts hereof as well as nucleic acid molecules, which 
with respect to their sequence are identical in part to such sequences. 

25 

A presently preferred embodiment relates to medical uses of any of the polypeptides 
and/or nucleic acids according to the present invention as well as methods of treatment of 
malaria comprising molecules of the invention. 

30 Other aspects of the invention include pharmaceutical compositions and vaccines based on 
the molecules of the invention. In addition, the invention comprises polypeptides or nucleic 
acid molecules of the invention as medicaments, and the use of these polypeptides and 
nucleic acids for the manufacture of compositions, hereunder-immunogenic compositions 
which are to administered in order to prophylactically or therapeutically reduce the 

35 incidence, prevalence or severity of malaria, especially severe malaria. 
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It is further within the scope of the present invention to provide a method of treatment 
and prevention of malaria, which comprises administering an effective amount of one or 
more of the described molecules of the invention to a subject. 

5 It will appear that the mentioned polypeptides and nucleic acid molecules will also be 
useful as biotechnological tools. Therefore, the invention also relates to in vitro diagnostic 
methods, which comprise contacting a sample with polypeptides or nucleic acid molecules 
having the sequences described above, allowing in vitro reactions to occur and 
subsequently detecting any molecular complexes formed. These may for instance be 
10 complexes of antigens and antibodies. In some aspects of the invention, the polypeptides 
of the invention are parts of diagnostic kits. Alternatively, these kits may comprise 
antibodies, which specifically recognise such polypeptides. Kits may also compromise, but 
are not limited to oligonucleotides as part of diagnostic kits based on techniques such as 
array, PCR, and real-time quantitative PGR. 

15 Detailed Description of the Invention 

The new strategy 

The present inventors have taken a novel approach and based their strategy on the hereby 
disclosed knowledge of the serological phenotype expressed by parasites causing severe 
malaria (VSA SM ) and the availability of all var genes sequences in the parasite line 3D7. 

20 

In culture 3D7 will normally express a typical VSA UM serotype, thus the present inventors 
first developed a method to generate VSA SM expressing 3D7 lines and then used 
quantitative real time PCR to identify the dominant var gene transcripts. 

25 They hereafter made specific probes and antibodies, that allowed them to identify the 
PfEMPl molecules expressed on the surface of the erythrocytes responsible for the 
serological phenotype carried by the selected parasite line. 

This serological phenotype was also obtained when the 3D7 parasite line was selected for 
30 adhesion to bone marrow derived endothelial cells. 

The relevance of the discovered PfEMPl molecules were further substantiated by the fact 
that parasites expressing the molecules have a high growth rate in non-immune 
individuals, and that var genes with a high similarity to the discovered genes can be 
35 identified in other parasites. Their approach is completely different from previous 

strategies of defining variant molecules responsible for binding to different receptors. 
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The strategy is based on both epidemiological studies showing that immunity to severe 
malaria is acquired more rapidly than protection from uncomplicated disease and sub- 
clinical infection findings indicating that VSA SM constitute a restricted and antigenically 
conserved VSA subset, whereas VSA UM are more diverse. 

5 

Together, these observations enabled developing of the disease-ameliorating strategies 
presented in the present application, which can protect against mortality and severe 
morbidity by accelerating acquisition of immunity to VSA SM -expressing parasites and 
thereby forcing VSA expression away from VSA S m. 

10 

As highlighted earlier Plasmodium falciparum erythrocyte membrane protein- 1 (PfEMPl), 
are a highly polymorphic and diverse family of proteins. Every parasite genome carries 
about 60 genes encoding PfEMPl and the repertoire of PfEMPl genes differ from parasite 
genome to parasite genome. Thus, PfEMPl genes show both intra- and inter-genomic 

15 variation, and the global repertoire of PfEMPl proteins is unknown, but estimated to be 
very large, in fact the present inventors have estimated that there are more 10 27 possible 
variants of this protein. The common features shared by the PfEMPl family of genes and 
proteins are the organization of the genes (two exons and an intron), and the presence of 
domain structures that can be classified as Duffy Binding Ligand-like (DBL) or Cysteine- 

20 Rich Interdomain Region (CIDR). 

In addition, the proteins share a relative conserved C-terminal tail consisting of a trans- 
membrane region and a relatively short intracellular domain. However, it must be stressed 
that the genes and the encoded proteins vary considerably between each other; both with 
25 regards to sequence (primary structure) and organization of the domains (Lavstsen et al., 
2003). 

It is also clear that expression of different PfEMPl molecules confer parasite different 
functional (Smith et al., 2000; Robinson et al., 2003) and antigenic characteristics (Salanti 
30 et al., 2003). Within PfEMPl domains classified as belonging to the same group and 
subgroup (i.e. DBLa, DBLp, CIDRy etc) short identity blocks of 2-14 amino acids can be 
identified between hyper variable blocks of varying lengths (of up to several hundred 
amino acids) in which there is no or very little homology between randomly chosen 
PfEMPl. Thus, PfEMPl molecules constitute a very large and diverse family of proteins. 

35 

Inventive concept 

The inventive concept disclosed in the present application is based on the unexpected 
observation that the transcription of 3 specific Plasmodium falciparum var genes, SEQ ID 
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NO.: 1, SEQ ID NO.: 3, and SEQ ID NO.: 5, all members of Group A var genes, is up- 
regulated in parasite lines and clones carrying the VSA SM phenotype. This up-regulation 
followed both after antibody-selection and selection for adhesion of infected RBC to bone 
marrow derived endothelial cells in vitro. 

5 

These observations shows that the proteins encoded by these three specific var genes are 
responsible for inducing the first IgG with specificity to PfEMPl on iRBC and thus the 
dominant PfEMPl expressed during P. falciparum infections in immunologically naive 
individuals. Such proteins are useful as therapeutic and prophylactic agents as well as 

10 biological tools and diagnostic agents for the study, treatment and prevention of malaria, 
since these proteins described serve a unique function for the parasite in providing high 
growths rates in non-immune individuals and renders these parasites a unique serological 
phenotype. This is of great potential importance because it makes it possible to design 
strategies that will specifically aim at reducing malaria deaths by preventing, diagnosing, 

15 and treating malaria. 

Polypeptide molecules of the invention 

Thus, in its broadest aspect, the present invention relates to these 3 isolated polypeptides 
comprising at least one amino acid sequence selected from the group consisting of at least 
one of 

20 

a) SEQ ID NO.: 2, SEQ ID NO.: 4 or SEQ ID NO.: 6, and 

b) a sequence having at least 80% sequence identity to a), and 

c) sub-sequences of a) or b) with a minimum length of 10 amino acids, and 

d) sub-sequences of a) b) comprising at least one B-ceil epitope; 

25 

With the proviso that the Exon 2 of SEQ ID NO. : 2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6 is 
excluded. 

With the proviso that the "Fragment 1" and/or "Fragment 2" and/or "Fragment 3" and/or 
30 "Fragment 4" and /or "Fragment 5" and/or "Fragment 6" and/or "Fragment 7" and/or 
"Fragment 8" and/or "Fragment 9" and/or "Fragment 10" and/or "Fragment 11" and/or 
"Fragment 12" and/or "Fragment 13" and/or "Fragment 14" and/or "Fragment 15" and/or 
"Fragment 16" and/or "Fragment 17" "Fragment 18" and/or "Fragment 19" and/or 
"Fragment 20" and/or "Fragment 21" and/or "Fragment 22" and/or "Fragment 23" and/or 
35 "Fragment 24" and/or "Fragment 25" and/or "Fragment 26" of SEQ ID NO.: 2, SEQ ID 
NO.: 4, and/or SEQ ID NO.: 6 is excluded. 

With the proviso that the "Fragment 27" and/or "Fragment 28" and/or "Fragment 29" 
and/or "Fragment 30" and/or of SEQ ID NO.: 2 and/or SEQ ID NO.: 6 is excluded. 
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With the proviso that the "Fragment 31" and/or "Fragment 32" of SEQ ID NO.: 2 is 
excluded. 

5 With the proviso that the amino acid sequence of SEQ ID NO.: 2, SEQ ID NO.: 4 and/or 
SEQ ID NO.: 6 is excluded. 

For the present and any of the following aspects of the invention it applies that it is an 
object of preferred embodiments of the present invention to provide polypeptides, which 
are subject to antibody recognition by antibodies in sera from young children living in 
areas of high malaria transmission intensity and/or capable of mediating cyto-adhesion of 
intact erythrocyte infected by a parasite to endothelial cells, but not the CD36 receptor. 
The distinction to CD36 adhesion can be evaluated by the examples of the present 
invention, e.g. as described in example 4, 5 and example 22. 

Length of the molecules 

For all the aspects of the invention, it is apparent that the polypeptides of the invention, 
which form the basis of the described embodiments of the invention may be less or equal 
to any length between 6 - 3552 (SEQ ID NO. : 2), 6 - 2992 (SEQ ID NO.: 4) and 6 - 3344 
20 (SEQ ID NO.: 6) amino acids, since these polypeptide all have great immunogenic 
capabilities and thus are especially great as candidates for the development for the 
treatment of malaria, particularly in the development of a vaccine. 

Thus the present invention relates to any sub-sequence originating from any of the 
Z5 polypeptides having an amino acid length such as but not limited to less than or equal to 
6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 
30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 
53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 
76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 
30 99, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 225, 250, 275, 300, 350, 

400, 450, 500, 600, 700, 800, 900, 1000, 1250 , 1500, 1750, 2000, 2250, 2500, 2750, or 
2990 amino acids in length. In a further embodiment the length of a sub-sequence of SEQ 
ID NO.: 2 can be less than or equal to 3000, 3100, 3200, 3300, 3400, 3500 or 3550 amino 
acids in length, whereas the length of a sub-sequence of SEQ ID NO.: 6 furthermore can 
35 be less than or equal to 3000, 3100, 3200, 3300 or 3340 amino acids in length. 

In addition to these sequences, fragments or sub-sequences of the polypeptide of the 
invention, larger proteins/polypeptides comprising such sub-sequences as part of their 
sequence, are also embodiments of the present invention, thus with respect to all aspects 
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of the invention it may be preferred that the polypeptides of the invention may have a 
length above the disclosed full length of 3553 (SEQ ID NO.: 2), 2994 (SEQ ID NO. : 4) or 
3346 (SEQ ID NO.: 6) amino acids, if combined with other amino acids for e.g. preparation 
purposes. 

5 

Some characteristic lie within the polypeptide sequence of SEQ ID NO.: 2, SEQ ID NO.: 4, 
and SEQ ID NO. : 6 such as B- and T-cell epitopes. 

T-cell epitopes were defined using the SYFPEITHI server at Centre for Biological Sequence 
10 Analysis BioCentrum-DTU at the Technical University of Denmark ( http://syfpeithi.bmi- 
heidelberq.com/Scripts/MHCServer.dll/ EpPredict.htm ) with default settings. B-cell 
epitopes were defined using the Protean 4.0 software in the DNAstar package with default 
settings. 

15 For SEQ ID NO.: 2 predicted T-cell epitopes are defined as, but not limited to amino acid 
no. 13-45, 93-107, 135-167, 201-236, 247-261, 302-316, 327-348, 394-409, 432-446, 
457-473, 532-546, 573-589, 595-609, 625-639, 678-692, 870-890, 942-956, 986-1000, 
1097-1111, 1130-1144, 1151-1165, 1196-1210, 1473-1492, 1508-1548, 1593-1630, 
1633-1647, 1800-1814, 1830-1844, 1923-1937, 1943-1957, 1997-2025, 2235-2249, 

20 2322-2336, 2378-2404, 2427-2441, 2464-2478, 2507-2521, 2533-2553, 2608- 

2637,26662-2683, 2693-2720, 2794-2808, 2851-2866, 2879-2904, 2965-2979, 3074- 
3088, 3092-3149, 3201-3216, 3236-3250, 3274-3288, 3297-3311, 3322-3336, 3350- 
3364, 3375-3395, 3488-3502, 3530-3544. 

25 For SEQ ID NO.: 2 predicted B-cell epitopes are defined as, but not limited to amino acid 
no. 197-281, 365-470, 526-575, 631-772, 820-891, 905-933, 961-1003, 1024-1066, 
1073-1188, 1206-1248, 1269-1367, 1402-1437, 1535-1731, 1991-2026, 2054-2096, 
2124-2229, 2278-2313, 2404-2460, 2488-2530, 2663-2713, 2832-2965, 3014-3112, 
3154-3210, 3322-3406. 

30 

For SEQ ID NO.: 4 predicted T-cell epitopes are defined as, but not limited to amino acid 
no. 117-131, 154-168, 177-213, 224-238, 258-272, 307-321, 332-346, 419-434, 454- 
468, 512-534, 561-595, 619-641, 645-680, 691-710, 781-795, 983-997, 1043-1057, 
1065-1093, 1143-1185, 1296-1310, 1383-1397, 1453-1467, 1567-1597, 1653-1667, 
35 1693-1707, 1740-1754, 1774-1836, 1855-1873, 1929-1943, 2066-2080, 2108-2122, 
2130-2156, 2174-2188, 2206-2220, 2282-2296, 2309-2323, 2439-2453, 2552-2566, 
2578-2592, 2625-2639, 2644-2673, 2697-2714, 2719-2733, 2744-2774, 2802-2816, 
2918-2943, 2971-2985. 
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For SEQ ID NO.: 4 predicted B-cell epitopes are defined as, but not limited to amino acid 
no. 30-65, 89-113, 125-142, 213-272, 390-432, 491-514, 733-792, 833-851, 874-904, 
1040-1069, 1128-1170, 1187-1211, 1258-1276, 1306-1365, 1394-1406, 1441-1499, 
1542-1571, 1648-1678, 1884-1908, 1949-1979, 2268-2309, 2345.2369, 2392-2416, 
5 2853-2906. 

For SEQ ID NO.: 6 predicted T-cell epitopes are defined as, but not limited to amino acid 
no. 14-28, 43-57, 71-85, 222-248, 251-265, 301-315, 318-332, 359-373, 416-443, 446- 
460, 471-485, 518-532, 579-593, 624-643, 783-797, 949-967, 986-1000, 1016-1030, 

10 1075-1093, 1141-1155, 1265-1279, 1340-1372, 1440-1471, 1505-1515, 1529-1556, 
1634-1648, 1659-1673, 1687-1701, 1712-1726, 1813-1827, 1841-1855, 1875-1995, 
2092-2110, 2137-2151, 2161-2171, 2249-2263, 2319-2333, 2379-2393, 2475-2495, 
2502-2516, 2549-2564, 2591-2605, 2629-2643, 2650-2669, 2674-2688, 2734-2766, 
2916-2930, 2989-3018, 3038-3052, 3115-3129, 3131-3145, 3167-3187, 3248-3262, 

15 3323-3337. 

For SEQ ID NO.: 6 predicted B-cell epitopes are defined as, but not limited to amino acid 
no. 7-27, 383-429, 627-786, 898-990, 1089-1221, 1307-1340, 1373-1525, 1670-1815, 
1901-1934, 2007-2027, 2225-2251, 2363-2409, 2495-2528, 2614-2660, 2706-2911, 
20 3221-3300. 

Some characteristic structures lie within the polypeptide sequence of SEQ ID NO.: 2, SEQ 
ID NO.: 4, and SEQ ID NO.: 6 and therefore also within the nucleotide sequence encoding 
these polypeptide sequences. Such structures comprise, but are not necessarily limited to 
25 DBL2, DBL5, and C2. 

Preferred embodiments of the present invention include specific sub-sequences of the 
polypeptide of the invention having a minimum length of 10 amino acids such as sub- 
sequences that are at least 100 amino acids long. In even more preferred embodiments of 
30 the invention, these sub-sequences can be shown by known molecular biological 
techniques to be involved in the interaction with endothelial receptors and/or to be 
recognised by antibodies using plasma from young semi-immune children. 

It is anticipated that relatively short sequences within the SEQ ID NO.: 2, SEQ ID NO.: 4 
35 and/or SEQ ID NO.: 6 are responsible for mediating adhesion to endothelial receptors 
other than CD36. In particular, it is possible that certain CIDR or DBL domains or parts 
hereof are responsible for the adhesion and/or for antibody recognition. 
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In other preferred embodiments of the invention, the sequences of the polypeptide of the 
invention can be shown to possess one or more antigen epitopes. In particular, such 
epitopes may be B-cell epitopes. Optionally, the sub-sequences may also comprise one or 
more T-cell epitopes alone or in combination with the B-cell epitopes. Finally, also larger 
5 polypeptides comprising the polypeptide of the invention or sub-sequences hereof with 
antigen epitopes and/or sequences involved in interaction with endothelial receptors are 
embodiments of the present invention. 

It is also apparent that the polypeptide sequences of the invention can be present in the 
10 form of fusion proteins. In a further preferred embodiment, this fusion protein will 

comprise polypeptide sequences, which will facilitate the purification or detection of the 
protein. These polypeptide sequences may be but are not limited to tags that will facilitate 
purification and detection using commercially available systems such as the HA- ,-c-myc, 
His or GST tags. 

15 

The polypeptide embodiments of the present invention can therefore exhibit a vast degree 
of sequence identity to the full-length of SEQ ID NO.: 2, SEQ ID NO.: 4 and/or SEQ ID 
NO.: 6. It can for instance be appreciated that a fusion protein carrying within its sequence 
one or more B-cell epitopes and or regions of the polypeptide of the invention that are 
20 involved in adhesion to endothelial receptors will have a relatively low overall degree of 
sequence identity to full-length SEQ ID NO.: 2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6. 

Sequence identity of the polypeptide molecules 

For all the aspects of the invention, it is thus apparent that the polypeptides of the 
25 invention may include sequences, which show anywhere between 40-100% sequence 

identity, such as at least 41%, at least 42%, at least 43%, at least 44%, at least 45%, at 
least 46%, at least 47%, at least 48%, at least 49% at least 50%, at least 55%, at least 
60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, 
at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at 
30 least 89%, at least 90%at least 91%, at least 92%, at least 93%, at least 94%, at least 
95%, at least 96%, at least 97%, at least 98%, at least 99 % or preferably 100% 
sequence identity to SEQ ID NO.: 2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6 or a fragment or 
sub-sequence thereof. 

35 As understood by the skilled adressee, the sequence identity of the polypeptide molecules 
of the present invention will be higher the smaller the fragments or sub-sequences in order 
to be unique, thus in some embodiments of the present invention, the fragments or sub- 
sequences are of 20-50 amino acid long and has a sequence identity of 80-100%, such as 
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20-50 amino acid long, 20-40 amino acid long, 20-30 amino acid long, 20-25 amino acid 
long, 30-50 amino acid long, 30-40 amino acid long, 30-35 amino acid long, 40-50 amino 
acid long, 40-45 or 45-50 amino acid long all with a sequence identity of 80-100%, such 
as 80% identical, 81% identical, 82% identical, 83% identical, 84% identical, 85% 
5 identical, 86% identical, 87% identical, 88% identical, 89% identical, 90% identical, 91% 
identical, 92% identical, 93% identical, 94% identical, 95% identical, 96% identical, 97% 
identical, 98% identical, 99% identical or even 100% identical. 

In other embodiments of the present invention, the fragments or sub-sequences are of 51- 
10 750 amino acid long and has a sequence identity of 70-100%, such as 51-750 amino acid 
long, 60-700 amino acid long, 70-600 amino acid long, 80-500 amino acid long, 90-400 
amino acid long, 100-250 amino acid long, 300-350 amino acid long, 400-500 amino acid 
long, 200-650 or 450-500 amino acid long all with a sequence identity of 70-100%, such 
as 70% identical, 71% identical, 72% identical, 73% identical, 74% identical, 75% 
15 identical, 80% identical, 85% identical, 88% identical, 89% identical, 90% identical, 91% 
identical, 92% identical, 93% identical, 94% identical, 95% identical, 96% identical, 97% 
identical, 98% identical, 99% identical or even 100% identical. 

In yet other embodiments of the present invention, the fragments or sub-sequences are of 
20 more than 751 amino acid long and has a sequence identity of 60%-100%, such as more 
751 amino acid long, more 800 amino acid long, more than 900 amino acid long, more 
than 1000 amino acid long, more than 1100 amino acid long, more than 1200 amino acid 
long, more than 1300 amino acid long, more than 1400 amino acid long, more than 2000 
amino acids long or 2500 amino acid long all with a sequence identity of 60-100%, such as 
25 60% identical, 61% identical, 62% identical, 63% identical, 64% identical, 65% identical, 
70% identical, 75% identical, 77% identical, 80% identical, 85% identical, 88% identical, 
92% identical, 93% identical, 94% identical, 95% identical, 96% identical, 97% identical, 
98% identical, 99% identical or even 100% identical. 

30 Additionally, variants are also an embodiment of the present invention and are determined 
based on a predetermined number of conservative amino acid substitutions as defined 
herein below. Conservative amino acid substitution as used herein relates to the 
substitution of one amino acid (within a predetermined group of amino acids) for another 
amino acid (within the same group), wherein the amino acids exhibit similar or 

35 substantially similar characteristics. 



40 



Within the meaning of the term "conservative amino acid substitution" as applied herein, 
one amino acid may be substituted for another within the groups of amino acids indicated 
herein below: 
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Amino acids having polar side chains (Asp, Glu, Lys, Arg, His, Asn, Gin, Ser, Thr, Tyr, and 
Cys,) 

Amino acids having non-polar side chains (Gly, Ala, Vai, Leu, lie, Phe, Trp, Pro, and Met) 
Amino acids having aliphatic side chains (Gly, Ala Val, Leu, He) 
5 Amino acids having cyclic side chains (Phe, Tyr, Trp, His, Pro) 
Amino acids having aromatic side chains (Phe, Tyr, Trp) 
Amino acids having acidic side chains (Asp, Glu) 
Amino acids having basic side chains (Lys, Arg, His) 
Amino acids having amide side chains (Asn, Gin) 
10 Amino acids having hydroxy side chains (Ser, Thr) 

Amino acids having sulphor-containing side chains (Cys, Met), 
Neutral, weakly hydrophobic amino acids (Pro, Ala, Gly, Ser, Thr) 
Hydrophilic, acidic amino acids (Gin, Asn, Glu, Asp), and 
Hydrophobic amino acids (Leu, He, Val) 

15 

Preferred conservative amino acids substitution groups are: valine-leucine-isoleucine, 
phenylalanine-tyrosine, lysine-arginine, alanine-valine, and asparagine-giutamine. 

Accordingly, a variant or a fragment thereof according to the invention may comprise, 
20 within the same variant of the sequence or fragments thereof, or among different variants 
of the sequence or fragments thereof, at least one substitution, such as a plurality of 
substitutions introduced independently of one another. 

It is clear from the above outline that the same variant or fragment thereof may comprise 
25 more than one conservative amino acid substitution from more than one group of 
conservative amino acids as defined herein above. 

The present invention further relates to addition or deletion of at least one amino acid in 
relation to any of the amino acid sequences according to the invention and may be an 
30 addition or deletion of from preferably 2 to 50 amino acids, such as from 10 to 20 amino 
acids, for example from 20 to 30 amino acids, such as from 40 to 50 amino acids. 

However, additions or deletions of more than 50 amino acids, such as additions from 50 to 
100 amino acids, addition of 100 to 150 amino acids, addition of 150-250 amino acids, are 
35 also comprised within the present invention. Even additions or deletions of 500-2000 

amino acids are within the scope of the present invention. The deletion and/or the addition 
may - independently of one another - be a deletion and/or an addition within a sequence 
and/or at the end of a sequence. 

40 The polypeptide fragments according to the present invention, including any functional 
equivalents thereof, may in one embodiment comprise less than 250 amino acid residues, 
such as less than 240 amino acid residues, for example less than 225 amino acid residues, 
such as less than 200 amino acid residues, for example less than 180 amino acid residues, 
such as less than 160 amino acid residues, for example less than 150 amino acid residues, 
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such as less than 140 amino acid residues, for example less than 130 amino acid residues, 
such as less than 120 amino acid residues, for example less than 110 amino acid residues, 
such as less than 100 amino acid residues, for example less than 90 amino acid residues, 
such as less than 85 amino acid residues, for example less than 80 amino acid residues, 
5 such as less than 75 amino acid residues, for example less than 70 amino acid residues, 
such as less than 65 amino acid residues, for example less than 60 amino acid residues, 
such as less than 55 amino acid residues, for example less than 50 amino acid residues. 

"Functional equivalency" as used in the present invention is according to one preferred 
10 embodiment established by means of reference to the corresponding functionality of a 
predetermined fragment of the sequence. 

It is understood that the polypeptide fragments of the invention may possess one or more 
types of post-translational modifications when expressed on the cell surface. These 
15 modifications may comprise, but are not limited to, glycosylation, phosphorylation, 
acylation, cross-linking, proteolytic cleavage, linkage to an antibody molecule, a 
membrane molecule, or another ligand. 

It is an object of presently preferred embodiments of the present invention that the 
20 polypeptide comprises the amino acid sequence shown in SEQ ID NO. : 2, SEQ ID NO.: 
4 or SEQ ID NO. : 6. 

It is an object of presently most preferred embodiments of the present invention that 
the polypeptide consists of the amino acid sequence shown in SEQ ID NO.: 2, SEQ ID 
25 NO.: 4 or SEQ ID NO.: 6. 



Nucleic acid molecules 

The embodiments of the present invention thus relate to polypeptides of the PfEMPl class 
or sub-sequences hereof as well as nucleic acid molecules encoding such polypeptides or 
30 sub-sequences, wherein said polypeptides and sub-sequences comprise structures that are 
involved directly or indirectly in the binding to endothelial receptors and/or recognised by 
plasma antibodies from young semi-immune children. 

The PFD1235W/MAL7P1.1, PF11_0008, and PF13_0003 genes presented in SEQ ID NO.: 1, 
35 SEQ ID NO.: 3, SEQ ID NO.: 5 and/or the codon optimised SEQ ID NO.; 7 are members of 
Group A vargenes and, in their widest perspective, the embodiments of the invention thus 
relate to nucleic acid molecules, which are characteristic in that they do not belong to the 
varl, var2 gene subfamily as defined in Salanti et al. 2002 and Salanti et al. 2003 or 
Group B, Group C, and Group B/C, but does not exclude Group B/A var genes as defined in 
40 Lavstsen et al. 2003. 
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Furthermore, nucleic acid molecules, which are complementary to the nucleic acid 
molecules of the invention as described above as well as polypeptides encoded by these 
nucleic acid molecules are within the scope of the invention. 

5 One embodiment of the present invention relates to a nucleic acid molecule comprising at 
least one nucleotide sequence selected from the group consisting of at least one of 

a) SEQ IN NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and codon optimised SEQ ID.: 7 
or a sequence complementary thereof; and 
10 b) a nucleotide sequence having at least 80% sequence identity to a); and 

c) sub-sequences of a) or b) with a minimum length of 30 nucloetides; and 

d) sub-sequences of a) or b) which comprises at least one sequence encoding a B- 
cell epitope; 

15 As understood by the skilled adressee, the sequence identity of the nucleic acid molecules 
of the present invention will be higher the smaller the fragments or sub-sequences in order 
to be unique, thus in some embodiments of the present invention, the fragments or sub- 
sequences are of 60-150 nucleic acid long and has a sequence identity of 80-100%, such 
as 60-140 nucleic acid long, 60-130 nucleic acid long, 60-120 nucleic acid long, 60-110 

20 nucleic acid long, 75-150 nucleic acid long, 80-140 nucleic acid long, 90-135 nucleic acid 
long, 90-150 nucleic acid long, 120-130 or 125-150 nucleic acid long all with a sequence 
identity of 80-100%, such as 80% identical, 81% identical, 82% identical, 83% identical, 
84% identical, 85% identical, 86% identical, 87% identical, 88% identical, 89% identical, 
90% identical, 91% identical, 92% identical, 93% identical, 94% identical, 95% identical, 

25 96% identical, 97% identical, 98% identical, 99% identical or even 100% identical. 

In other embodiments of the present invention, the fragments or sub-sequences are of 
151-2200 nucleic acid long and has a sequence identity of 70-100%, such as 151-2100 
nucleic acid long, 200-1700 nucleic acid long, 300-1600 nucleic acid long, 400-1500 

30 nucleic acid long, 500-1400 nucleic acid long, 1000-1250 nucleic acid long, 1300-1350 
nucleic acid long, 1400-1500 nucleic acid long, 200-1650 or 450-2000 nucleic acid long all 
with a sequence identity of 70-100%, such as 70% identical, 71% identical, 72% identical, 
73% identical, 74% identical, "75% identical, 80% identical, 85% identical, 88% identical, 
89% identical, 90% identical, 91% identical, 92% identical, 93% identical, 94% identical, 

35 95% identical, 96% identical, 97% identical, 98% identical, 99% identical or even 100% 
identical. 

In yet other embodiments of the present invention, the fragments or sub-sequences are of 
more than 2201 nucleic acid long and has a sequence identity of 60%-100%, such as more 
40 2201 nucleic acid long, more 2300 nucleic acid long, more than 2400 nucleic acid long, 
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more than 3500 nucleic acid long, more than 4000 nucleic acid long, more than 5000 
nucleic acid long, more than 5500 nucleic acid long, more than 6000 nucleic acid long, 
more than 7000 nucleic acids long or 8500 nucleic acid long all with a sequence identity of 
60-100%, such as 60% identical, 61% identical, 62% identical, 63% identical, 64% 
5 identical, 65% identical, 70% identical, 75% identical, 77% identical, 80% identical, 85% 
identical, 88% identical, 92% identical, 93% identical, 94% identical, 95% identical, 96% 
identical, 97% identical, 98% identical, 99% identical or even 100% identical. 

With the proviso that the Exon 2 of SEQ ID NO.: 1, 3 and/or 5 is excluded 

10 

With the proviso that the "Fragment 1" and/or "Fragment 2" and/or "Fragment 3" and/or 
"Fragment 4" and /or "Fragment 5" and/or "Fragment 6" and/or "Fragment 7" and/or 
"Fragment 8" and/or "Fragment 9" and/or "Fragment 10" and/or "Fragment 11" and/or 
"Fragment 12" and/or "Fragment 13" and/or "Fragment 14" and/or "Fragment 15" and/or 
15 "Fragment 16" and/or "Fragment 17" "Fragment 18" and/or "Fragment 19" and/or 

"Fragment 20" and/or "Fragment 21" and/or "Fragment 22" and/or "Fragment 23" and/or 
"Fragment 24" and/or "Fragment 25" and/or "Fragment 26" of SEQ ID NO.: 1, SEQ ID 
NO.: 3, SEQ ID NO. : 5 and/or SEQ ID NO.: 7 is exluded. 

20 With the proviso that the "Fragment 27" and/or "Fragment 28" and/or "Fragment 29" 
and/or "Fragment 30" and/or of SEQ ID NO. : 1, SEQ ID NO. : 5 and/or SEQ ID NO. : 7 is 
excluded. 

With the proviso that the "Fragment 31" and/or "Fragment 32" of SEQ ID NO. : 1 and/or 
25 SEQ ID NO.: 7 is excluded. 

With the proviso that the nucleotide sequence of SEQ ID NO. : 1, SEQ ID NO.: 3 SEQ ID 
NO.: 5 and/or codon optimised SEQ ID.: 7 is excluded. 

30 The cDNA sequence encoding SEQ ID NO. 2, SEQ ID NO.: 4, and /or SEQ ID NO.: 6 in the 
parasite line 3D7 is provided in the sequence listing as SEQ ID NO.: 1, SEQ ID NO.: 3 
and/or SEQ ID NO.: 5. 

Again, it is apparent for all the aspects of the invention that the nucleic acid molecules of 
35 the invention may be less than or equal to any length between 30-10662 (SEQ ID NO.: 1), 
30-8985 (SEQ ID NO.: 3) 30-10041 (SEQ ID NO.: 5) or 30-9473 (SEQ ID NO.: 7) 
nucleotides, such as less than or equal to 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 
42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 
65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 
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88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 110, 120, 130, 140, 150, 160, 170, 
180, 190, 200, 225, 250, 275, 300, 350, 400, 450, 500, 600, 700, 800, 900, 1000, 1250, 
1500, 1750, 2000, 2500, 30O0, 3500, 4000, 4500, 5000, 6000, 6400, 6500, 6600, 6700, 
6800, 6900, 7000, 7100, 72O0, 7300, 7400, 7455, 7456, 7457, 7459, 7460, 7461, 7462, 
5 7463, 7464, 7465, 7466, 7467, 7468, 7469, 7470, 7471, 7472, 7473 nucleotides in length 
for SEQ ID NO.: 7 and further 8000, 8100, 8200, 8300, 8400, 8500, 8600, 8700, 8800, 
8900, 8955, 8956, 8957, 8958, 8959, 8960, 8961, 8962, 8963, 8964, 8965, 8966, 8967, 
8968, 8969, 8970, 8971, 8972, 8973, 8974, 8975, 8976, 8977, 8978, 8979, 8980, 8981, 
8982, 8983, 8984, 8985 nucleotides in length for SEQ ID NO. : 3 and further 9000, 9100, 
10 9200, 9300, 9400, 9500, 9600, 9700, 9800, 9900, 10000, 10011, 10021, 10031, 10041 
nucleotides in length for SEQ ID NO.: 5, and furthermore 10050, 10100, 10200, 10300, 
10400, 10500, 10600, 10650, 10651, 10652, 10653, 10654, 10655, 10656, 10657, 
10658, 10659, or 10660 nucleotides in length for SEQ ID NO.: 1. 

15 Still with respect to all aspects of the invention it may be preferred that the nucleic acid 
molecules of the invention may have a length of 30 - 40, 30 - 50, 30 - 60, 30 - 70, 30 - 
80, 30 - 90, 30 - 100, 30 - 200, 30 - 400, 30 - 500, 30 - 1000, 30 - 1200, 30 - 1300, 30 - 
1400, 30 - 1500, 30 - 1600, 30 - 1700, 30 - 1800, 30 - 1900, 30 - 2000, 30 - 2250, 30 - 
2500, 30 - 2750, 30 - 3000, 30 - 3500, 30 - 4000, 30 - 4500, 30 - 5000, 30 - 6000, 30 - 

20 7000, 30 - 8000, 30 - 9000 nucleotides. 

In preferred embodiments of the invention, sub-sequences of the nucleic acid molecules of 
the invention have a minimum length of 30 nucleotides and in even more preferred 
embodiments these sub-sequences are at least 300 nucleotides long. 

25 

Sequence identity of the nucleic acid molecules 

Preferred nucleic acid embodiments further include nucleic acids encoding fragments of the 
polypeptide of the invention that are involved in interaction with endothelial receptors. In 
addition, it is an object of preferred embodiments that sub-sequences of the nucleic acid 
30 molecule of the invention comprise nucleotides encoding one or more B-cell epitopes 
and/or one or more T-cell epitopes. 

Some characteristic structures lie within the peptide sequence of SEQ ID NO.: 2, SEQ ID 
NO.: 4 and/or SEQ ID NO.: 6 and therefore also within the nucleotide sequence encoding 
35 this peptide sequence. Such structures comprise, but are not necessarily limited to, a 
DBL2p domain followed by a C2 domain. On the other hand, some common features have 
been identified for proteins encoded by Group B, Group C, and Group B/C var genes 
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including, but not limited to the DBL25 domain. These features are not found within the 
amino acid sequence of SEQ ID NO.: 2, SEQ ID NO. : 4 and/or SEQ ID NO.: 6. 

Further embodiments comprise nucleic acid molecules that complement full-length of SEQ 
5 ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and/or SEQ ID.: 7 or sequences identical in part 
hereto as well as nucleotide sequences that complement fragments of full-length SEQ ID 
NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and/or SEQ ID.: 7, or sequences identical in part 
hereto, 

10 Preferred complementary nucleic acid molecules of the invention comprise nucleic acid 

molecules that are complementary to fragments of SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID 
NO. : 5 and/or SEQ ID.: 1, which have a nucleotide sequence that encodes a polypeptide or 
parts of a polypeptide that are involved in interaction with endothelial receptors. 
Additionally, preferred complementary nucleic acid molecules of the invention are 

15 complementary to sequences encoding one or more B-cell epitopes and/or one or more T- 
cell epitopes. 

As discussed for the polypeptide-based compounds of the invention it is also apparent that 
the nucleotide based embodiments may represent only part of the full-length sequence. In 

20 addition these nucleotide sequences may be present in combination with exogenous 

sequences. For all the aspects of the invention, it is thus apparent that the nucleic acids 
molecules of the invention may include sequences that have anywhere between 1-100% 
sequence identity to the full-length sequence of SEQ ID NO.: 1, SEQ ID NO.: 3 SEQ ID 
NO.: 5 and/or SEQ ID NO.: 7, such as at least 5%, at least 10%, at least 15%, at least 

25 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, 
at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at 
least 85%, at least 90%, at least 95%, at least 97% or, preferably, 100% sequence 
identity to SEQ ID NO.: 1, SEQ ID NO.: 3 SEQ ID NO.: 5 and/or 7, or a fragment or sub- 
sequence thereof. 

30 

It is to be understood that the nucleotide sequence of SEQ ID NO.: 1, SEQ ID NO.: 3 
and/or SEQ ID NO.: 5, when present within the genome of the intact Plasmodium 
falciparum parasites as well as the polypeptide sequence of SEQ ID NO.: 2, SEQ ID NO.: 4 
and/or SEQ ID NO.: 6, when present in or on the surface of intact red blood cells infected 
35 with P. falciparum are excluded from the scope of the present invention. This applies to all 
embodiments of the invention described in the present application. Compounds of the 
invention may however comprise sub-sequences of SEQ ID NO.: 1, SEQ ID NO.: 3 SEQ ID 
NO.: 5 and/or SEQ ID NO.: 7 and sub-sequences of SEQ ID NO.: 2, SEQ ID NO.: 4 and/or 
SEQ ID NO.: 6 isolated and/or purified from the Plasmodium parasites or infected RBC. 
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Vectors 

In addition, recombinant polypeptides comprising sub-sequences of the amino acid 
sequence of SEQ ID NO.: 2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6 may be generated by 
use of the above-mentioned nucleic acid embodiments. These can be cioned into vectors 
5 by the use of cloning techniques known in the art. The sequence encoding the polypeptide 
of interest is thereby linked to a heterologous promoter sequence. It may be preferred to 
optimise the codon context and codon pairing for the particular expression system. With 
respect to the polypeptide embodiments of the invention the incorporation of a secretory 
leader sequence may also be of use. The vector can be an expression vector in any of the 
10 mammalian, yeast, amphibian, insect, parasite, or bacterial expression systems known in 
the art. It is therefore apparent that, with the exception of Plasmodium infected RBC, 
prokaryotic and eukaryotic cells hereunder mammalian cells and transformed cell tines as 
well as cells in animals possessing nucleotide and/or amino acid embodiments described 
herein, are within the scope of the present invention. 

15 

Cells and cell lines 

Propagation of such cells or ceil lines may be performed with the intention of providing 
recombinant forms of one or more of the nucleic acid or polypeptide embodiments of the 
invention in amounts that are sufficient for further processing or purification. It is therefore 

20 within the scope of the present invention to provide preparations of compounds, which 
comprise polypeptides of the invention as well as nucleic acid molecules encoding these 
polypeptides. Preparations of such compounds may have a desired degree of purity 
referring to the relative amounts of the desired polypeptide and for instance whole cell 
proteins and unwanted variants of the desired polypeptide as defined above. The existence 

25 of a wide range of protein purification and concentration techniques is known to the skilled 
artisan. These techniques include gel electrophoresis, ion-exchange chromatography, 
affinity and immunoaffinity chromatography, ceramic hydroxyapatite chromatography, 
differential precipitation, molecular sieve chromatography, isoelectric focusing, gel 
filtration, and diafiitration. 

30 

For the various types of chromatography, the desired molecules are suspended in a buffer, 
which promotes adhesion of the molecules to the active surface of the resin and are then 
applied to the chromatography column. Removal of contaminants is performed by washing 
the resin in a buffer of intermediate ionic strength or pH. Elution of the desired molecules 
35 is performed by changing the ionic strength or pH of the buffer to values that will promote 
the dissociation of the molecules from the active surface of the resin used. In the case of 
immunoaffinity chromatography, the polypeptide may be purified by passage through a 
column containing a resin to which is bound antibodies which are specific for at least a 
portion of the polypeptide. Furthermore, His- or GST tags may be added to the 
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polypeptides of the invention. Subsequently, the resulting fusion proteins can be purified 
by affinity chromatography on for instance glutathione sepharose 4B and HIS tag Metal 
Chelate Affinity Chromatography. 

5 It is readily apparent that a person skilled in the art can create nucleic acid molecules of 
virtually any length by ligating a nucleic acid molecule encoding any of the amino acid 
sequences of the present invention or any part thereof to an exogenous nucleotide 
sequence. Recombinant nucleic acid molecules generated by this approach are 
embodiments of the invention. A recombinant construct can be capable of replicating 
10 autonomously within a host cell or, alternatively, it can become integrated into the 

chromosomal DNA. Such a recombinant nucleic acid molecule can comprise a sequence of 
genomic DNA, cDNA, synthetic or semi-synthetic origin. Again , it is preferred that such 
nucleic acid molecules are encoding one or more B-cell epitopes and/or one or more T-ceil 
epitopes. 

15 

The nucleic acid embodiments of the present invention can be altered by genetic 
engineering so as to introduce substitutions, deletions and/or additions. In preferred 
embodiments of the invention, these alterations will provide for sequences encoding 
functionally equivalent molecules or molecules with the same or improved properties. Such 
20 changes of the polypeptide embodiments can be generated using techniques that are 
known to a person skilled in the art, including random mutagenesis and site-directed 
mutagenesis. 

The use of recombinant polypeptides of the invention may be preferred when it is required 
25 that the preparations of these polypeptides are essentially free of any other antigen with 
which they are natively associated, Le. free of any other antigen from Plasmodium 
parasites. As an alternative this may also be accomplished by synthesizing the polypeptide 
fragments by the well-known methods of solid or liquid phase peptide synthesis. 



30 Pharmaceutical compositions 

Additional aspects of the present invention relate to pharmaceutical compositions based on 
any of the polypeptide embodiments of the invention. In a presently preferred embodiment 
these pharmaceutical compositions relates to compositions capable of eliciting an immune 
response, which may or may not be in form of a vaccine. Preferably, such a composition 

35 comprises at least one amino acid sequence according to the present invention. 



It is an object of presently preferred embodiments of the present invention to provide such 
an amino acid sequence selected from the group consisting of at least one' of 
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a) SEQ ID NO.: 2, SEQ ID NO.: 4 and SEQ ID NO.: 6 and 

b) a sequence having at least 80% sequence identity to a); and 

c) sub-sequences of a) or b) with a minimum length of 30 amino acids; and 
5 d) sub-sequences of a) or b) comprising at least one B-cell epitope. 

Alternatively, the pharmaceutical composition according to the present invention may be 
based on any of the nucleotide embodiments of the invention. In a preferred embodiment, 
the pharmaceutical composition comprises at least one nucleotide sequence selected from 
10 the group consisting of at least one of 

a) SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID.: 7 or a 
sequence complementary thereof; and 

b) a nucleotide sequence having at least 80% sequence identity to a); and 
15 c) sub-sequences of a) or b) with a minimum length of 30 nucleotides; and 

d) sub-sequences of a) and b) which comprise at least one sequence 
encoding a B-cell epitope. 

Such compositions may or may not further comprise a vector containing said amino acid or 
20 nucleotide sequence. 

In a specially preferred embodiment, any of the pharmaceutical compositions described in 
the present application may further comprise a pharmaceutically acceptable carrier and/or 
an adjuvant. 

25 

Peptides, proteins and carbohydrates antigens are usually poorly immunogenic, or not 
immunogenic at all, when administered on their own. Better delivery systems and 
adjuvants may be needed to improve the ease of delivery and immunogenicity. In 
particular, in vivo immunisation with peptides and proteins to generate MHC class I specific 
30 response has been difficult. One possible technique to solve this is the use of ISCOMs, 

which are lipid carriers that act as adjuvants but have minimal toxicity, and they appear to 
load peptide and proteins into the cell cytopiasma, allowing MHC class I restricted T-celi 
response to these peptides, which make them useful in human immunisation. 

35 Pharmaceutical compositions comprising the nucleotide and polypeptide embodiments of 
the invention can be produced by conventional techniques so that the said sequences are 
present as monomeric, multimeric or multimerised agents. Furthermore, antibodies 
generated from the polypeptide embodiments of the invention may constitute part of such 
pharmaceutical compositions. In addition to the active ingredients, pharmaceutical 
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compositions may further comprise one or more physiologically acceptable carriers, 
proteins, supports, adjuvants as well as components that may facilitate the delivery of the 
active components of the compositions. As described above, a large number of adjuvants 
are available including but not limited to Freund's adjuvant, mineral gels such as 
5 aluminium hydroxide, and surface-active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, and dinitrophenol. As a 
pharmaceutical composition, the nucleic acid and peptide embodiments of the invention 
will be purified and processed through one or more formulation steps. A large variety of 
formulation buffers will be physiologically acceptable, such as phosphate, citrate, and other 
10 organic acids. 

It is further understood that a pharmaceutical composition must be clinically safe. More 
specifically, it must be free of virus and bacteria that can cause infection upon 
administration of the composition to a subject. It may therefore be necessary to process 
15 the composition through on or more steps of virus filtration and/or inactivation. The 
removal of virus by filtration can be obtained by passing the composition through a 
nanofilter, whereas virus inactivation can be accomplished by the addition of various 
detergents and/or solvents or other antiviral compounds to the composition. 

20 Antibodies 

The polypeptide and/or nucleic acid embodiments of the invention may be used in their 
purified form to generate various types of antibodies, and it is understood that such 
antibodies will also be considered as compounds of the invention. These antibodies may 
include, but are not limited to polyclonal, monoclonal, chimeric, single chain, Fab 

25 fragments and fragments produced by a Fab expression library. A person skilled in the art 
knows that antibodies can be produced by e.g. immunisation of various hosts including 
goats, rabbits, rats, and mice. The term 'immunisation' refers to the injection of a 
polypeptide with immunogenic properties. Depending on the host species various types of 
adjuvants can be used in order to increase the immunological response including but not 

30 limited to Freund's adjuvant, mineral gels such as aluminium hydroxide, and surface-active 
substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 
keyhole limpet hemocyanin, and dinitrophenol. 

It is preferred to use shorter sequences of the polypeptide of the invention fused to a 
35 powerful immunogenic molecule such as keyhole limpet hemocyanin resulting in the 

production of antibodies against this chimeric molecule. Accordingly, antibodies capable of 
recognising SEQ ID NO. 2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6 can be produced by 
injection of synthetic peptides consisting of as little as 3 to 30 amino. Thus, it is an object 
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of preferred embodiments of the present invention to provide such smail synthetic peptides 
of3-250 amino acids, such as but not limited to 15-150 amino acids, 20-140 amino acids, 
30-130 amino acids, 40-120 amino acids, 50-110 amino acids, 60-100 amino acids, 70-90 
amino acids, 80-85 amino acids corresponding to a particular sequence of the SEQ ID NO. 
5 2, SEQ ID NO.: 4 and/or SEQ ID NO.: 6 polypeptides. As an alternative, a more diverse set 
of antibodies can be generated by injection of a purified polypeptide embodiment of the 
invention. 

Monoclonal antibodies directed against any of the polypeptides of the present invention, 
10 such as a purified polypeptide embodiment of the invention, can be produced using any of 
the conventional techniques that provide for the production of antibodies from cell lines in 
continuous culture. These techniques include the hybridoma technique, the human B-cell 
hybridoma technique, and the EBV-hybridoma technique. 

It will be readily appreciated that polypeptides of the invention can be incorporated into 
vaccines capable of inducing protective immunity against a specific subtype of malaria. In 
relation to the present invention it is preferred that the vaccine is directed specifically 
against the infectious activity of Plasmodium falciparum in the brain or adhesion to 
endothelial cells, which is characteristic of severe malaria. 

One important aspect of the present invention therefore relates to a vaccine comprising 
one or more B-cell epitopes from a polypeptide encoded by any of the nucleotides 
according to the present invention. This vaccine is characterised in that it induces an 
antibody response wherein said antibody specifically recognises a molecule expressed on 
the surface of an intact erythrocyte infected by VSA SM type parasites or parasites that have 
been selected for their ability to mediate adhesion to endothelial receptors. Generally, this 
molecule is recognised by the antibodies from semi-immune children or children suffering 
from severe malaria. 

30 In preferred embodiments, antibodies directed against the polypeptides and/or nucleic 
acids of the invention can be administered to a subject in order to provide protection 
against the retention and sequestration of iRBC to endothelial cells which is characteristic 
of severe malaria. 

35 Effective amounts of an agent that will promote an immune response against a compound 
of the present invention can be administered to subjects living in endemic areas so as to 
prevent the contraction of malaria. 
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In another embodiment, a subject believed to be at risk for contracting malaria may be 
identified either by conventional methods or by one of the in vitro diagnostic techniques, 
which constitute other embodiments of the present invention. An effective amount of an 
agent that inhibits SEQ ID NO.: 2, SEQ ID NO. : 4, SEQ ID NO.: 6 and/or homoiogues 
5 hereof mediated sequestration or elicits an immune response in a subject can then be 
administered to this subject. 



Another embodiment of the present invention relates to an isolated antibody or isolated 
antiserum induced in response to one or more polypeptides according to the invention 
10 and/or to one or more nucleic acids as defined in the present application. 

Another embodiment of the present invention relates to an isolated antibody or isolated 
antiserum induced in response to one or more polypeptides according to the invention 
and/or to one or more nucleic acids as defined in the present application, wherein said 
15 antibody is capable of binding to a molecule expressed on the surface of an intact 
erythrocyte infected by a parasite causing malaria. 

Another embodiment of the present invention relates to an isolated antibody or isolated 
antiserum induced in response to one or more polypeptides according to the invention 
20 and/or to one or more nucleic acids as defined in the present application, wherein said 
antibody is capable of recognising paracites selected in vitro for expression of VSA SM . 

A presently preferred embodiment relates to an antibody as described by the present 
application, wherein said antibody is capable of binding to a molecule expressed on the 
25 surface of an intact erythrocyte infected by a parasite capable of mediating cyto-adhesion 
of intact erythrocyte infected by a parasite to human endothelial cells, but not the CD36 
receptor. 



Vaccines 

30 Additional aspects of the present invention relate to vaccines based on any of the 

embodiments of the invention, such as but not limited to a vaccine comprising any of the 
polypeptides, the nucleic acids or the recombinant vectors according to the present 
invention, said vaccine characterised in that it induces an immune response, wherein said 
immune response specifically recognises a molecule expressed on the surface of an intact 

35 erythrocyte infected by a parasites. 

In one embodiment such a vaccine relates to a vaccine comprising one or more B-cell 
and/or T-cell epitopes originating from any of the polypeptides, the nucleic acids or the 
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recombinant vectors of the present invention, said vaccine characterised in that it induces 
an immune response, wherein said immune response specifically recognises a molecule 
expressed on the surface of an intact erythrocyte infected by a parasites. 

5 One presently preferred embodiment of the present invention relates to a polypeptide- 
based vaccine comprises at least one amino acid sequence selected from the group 
consisting of at least one of 

a) SEQ ID NO.: 2, SEQ ID NO. : 4 and/or SEQ ID NO. : 6 and 
10 b) a sequence having at least 80% sequence identity to a); and 

c) sub-sequences of a) or b) with a minimum length of 30 amino acids; and 

d) sub-sequences of a) or b) comprising at least one B-cell epitope. 

Sub-sequences of the polypeptide of the invention used in a vaccine may be any of the 
15 above mentioned amino acid lengths and in addition to these fragments or sub-sequences 
of the polypeptide of the invention, larger polypeptides comprising sub-sequences of the 
invention as part of their sequence, are also embodiments of the present invention. It is 
preferred, however, that these sub-sequences have a minimum length of 30 amino acids 
and that they are at least 80% identical to a region of comparable length within the 
20 sequence of SEQ ID NO. : 2, SEQ ID NO. : 4 and/or SEQ ID NO. : 6. 

In recent years there has been increased focus on nucleic acid vaccines. Other aspects of 
the present invention therefore concern genetic immunisation in which nucleotide based 
vaccines such as vaccines based on DNA molecules or on RNA molecules, which result in 

25 the expression of one or more B-cell and/or T-cell epitopes from a polypeptide encoded by 
a member of the SEQ ID NO.: 1, 3, and/or 5 gene family and/or the codon optimised SEQ 
ID NO.: 7. As for the polypeptide based vaccine this vaccine is characterised in that it 
induces an antibody response wherein said antibody specifically recognises a molecule 
expressed on the surface of an intact erythrocyte infected by VSA SM type parasites that 

30 have been selected for their ability to mediate adhesion to endothelial cells or for increased 
antibody recognition by plasma from young semi-immune children. 

One embodiment of the present invention relates to a nucleotide based vaccine, such as a 
DNA vaccine, which results in the expression of a polypeptide comprising one or more B- 
35 cell and/or T cell epitopes from any of the polypeptide sequences of the present invention, 
wherein said vaccine is capable of inducing an immune respons which specifically 
recognises a molecule expressed on the surface of an intact erythrocyte infected by 
parasites, and wherein said parasites furthermore has a VSA SM phenotype that have been 
selected for antibody recognition or adhesion to endothelial receptors in vitro, and wherein 
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said molecule is recognised by antibodies from young children living in areas of high 
malaria transmission intensity. 

One embodiment relates to a DNA vaccine comprising at least one nucleic acid sequences 
5 to the present invention, wherein said vaccine is capable of inducing an immune response, 
wherein said immune response specifically recognises a molecule expressed on the surface 
of an intact erythrocyte infected by parasites. 

Another embodiment relates to a natural, synthetic or recombinant DNA or RNA vaccine 
10 having any nucleotide sequence according to the present invention, wherein the vaccine is 
capable of eliciting development of ant\-Plasmodium antibodies. 

In a preferred embodiment, the present invention relates to a nucleotide-based vaccine, 
which may be a DNA or RNA vaccine, comprising a vector comprising at least one 
15 nucleotide sequence selected from the group consisting of at least one of 

a) SEQ ID NO.: 1, SEQ ID NO.: 3 SEQ ID NO.: 5 and SEQ ID NO.: 7 or a 
sequence complementary thereof; and 

b) a nucleotide sequence having at least 80% sequence identity to a); and 
20 c) sub-sequences of a) or b) with a minimum length of 30 nucleotides; and 

d) sub-sequences of a) and b) which comprise at least one sequence 
encoding a B-cell epitope. 

The vaccine may thus comprise any of the sub-sequences of the nucleotide sequence of 
25 the invention, which may have any of the sequence identities described above. It is 

preferred, however, that these sub-sequences have a minimum length of 30 nucleotides 
and that they are at least 80% identical to a region of comparable length within the 
sequence of SEQ ID NO.: 1, SEQ ID NO.: 3 SEQ ID NO.: 5 and/or SEQ ID NO.: 7. 

30 According to this aspect of the invention one approach is to incorporate the DNA encoding 
a polypeptide of the invention or parts hereof into a viral or bacterial vector. The following 
organisms, among numerous others, may be employed for this purpose: Coxsackie virus, 
vaccinia virus, Salmonella typhi or Salmonella typhimurium (for oral administration). In 
each case the carrier organism must be acquired by the host cell and the relevant DNA 

35 sequences used for production of the polypeptide of the invention or parts hereof. These in 
turn are recognised as abnormal by the host or recipient and an immune response ensues. 
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Alternatively, the parasite nucleic acid sequence may be incorporated into an RNA virus or 
used to prepare viral replicons. This approach allows for the delivery of coding sequences, 
such as mRNA, to the host cell without risking a replicative, infectious process. 

5 In order to obtain expression of immunogenic polypeptides it is required that elements of a 
nucleotide-based vaccine are capable of entering into the relevant target cells of the 
subject receiving such a vaccine. Therefore, preferred embodiments of the invention 
include vaccines, which further comprise one or more agents and/or vectors to facilitate 
such entry. 

10 

In a further preferred embodiment, the vector component of a nucleotide-based vaccine 
comprises a promoter for driving the expression in a mammalian cell line, a nucleotide 
sequence encoding a leader peptide for facilitating secretion/release of a polypeptide 
sequence from a mammalian ceil, and a terminator. 

15 

The simple concept of a nucleotide-based vaccine is the inoculation of a recipient using the 
relevant DNA sequence alone. This 'naked DNA' approach avoids the administration of 
polypeptide directly, but its effectiveness depends on the ability of the host cell to utilise 
the injected DNA as a template for RNA and subsequent protein synthesis. 

20 

It is anticipated that the principal value of providing SM-specific protective immunity to 
sporozoite-induced infection will be, but not limited to individuals who have not previously 
been exposed to malaria or have only had a few cases of malaria. In the case of severe 
malaria such individuals will be infants and young children in endemic areas, 
25 travellers/tourists, soldiers or individuals from a non-malaria endemic area moving to or 
travelling in endemic areas. 

While not being limited by way of theory it is believed that the protection against malaria 
obtained by the use of a vaccine is most likely a result of IgGs blocking the interaction 
30 between the iRBC and endothelial receptors in various organs of the body such as, but not 
limited to the brain. It is also possible, however, that opsonized erythrocytes are killed by 
macrophages or T-cells, either by fagocytosis or by other means. 

In a preferred embodiment, the vaccine is therefore capable of inducing an IgG response 
35 and, accordingly, it comprises a polypeptide comprising one or more B-cell epitopes. It is 
desirable, however, that polypeptides comprising one or more T-cell epitopes are also part 
of the vaccine since assistance from T-cells may be required in order to obtain a good 
antibody response. 
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In another preferred embodiment of the invention, the vaccine is therefore based on the 
use of polypeptides of the invention wherein said polypeptides comprises one or more B- 
celi epitopes in combination with one or more T-cell epitopes. In a another preferred 
embodiment of the invention, the vaccine comprises B-cell epitopes in combination with T- 
celi epitopes originating from an exogenous molecule, and in an further preferred 
embodiment, the peptides of the vaccine comprises only B-cell epitopes. In equally 
preferred embodiments of the invention, the vaccine is based on nucleotide sequences 
encoding polypeptides, which have the characteristics with respect to antigen epitopes 
described above. 

Techniques exist for enhancing the antigenicity of immunogenic peptides including 
incorporation of these into a multimeric structure, binding to a highly immunogenic protein 
carrier, for example, keyhole limpet hemocyanin, or diptheria toxoid, and administration in 
combination with adjuvants or any other enhancers of immune response. Furthermore, it 
will be understood that polypeptides specific for a plurality of Plasmodium stages and 
species may be incorporated in the same vaccine composition to provide a multivalent 
vaccine. In addition, the vaccine composition may comprise antigens to provide immunity 
against other diseases in addition to malaria. 

Immunogenic polypeptides of the invention as well as nucleic acid molecules encoding such 
polypeptides may be injected as is, or for convenience of administration, it can be added to 
pharmaceutical^ acceptable carriers or diluents. Suitable pharmaceutically acceptable 
carriers will be apparent to those skilled in the art, and include water and other polar 
substances, including lower molecular weight aikanols, polyalkanols such as ethylene 
glycol, polyethylene glycol, and propylene glycol as well as non-polar carriers. 

Routes of administration, antigen dose, number and frequency of injections are all matters 
of optimisation within the scope of ordinary skill in the art, particularly in view of the fact 
that there is already experience in the art of providing protective immunity by the injection 
of irradiated sporozoites. Protective antibodies are usually best elicited by a series of 2 to 3 
doses given about 2 to 3 weeks apart. The series can be repeated when concentrations of 
circulating antibodies in the vaccinee drops. The polypeptide is present in the vaccine in an 
amount sufficient to induce an immune response against the antigenic polypeptide and 
thus to protect against Plasmodium infection thereby protecting the subject against 
malaria. 

Vaccination protocols can include the identification of a subject in need of a vaccine, for 
instance infants and young children living in regions populated with P. falciparum or non- 
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immune individuals e.g. tourist/travellers and soldiers travelling through such regions, and 
administration of one or more effective doses of the vaccine to this subject. 

In some aspects, the present invention can be used to both inhibit the adhesion of iRBC to 
endothelial receptors and to generate an immune response directed at SEQ ID NO. 2, SEQ 
ID NO.: 4 and/or SEQ ID NO. : 6. It is therefore within the scope of the invention to 
provide uses of any of the polypeptides of the present invention as medicaments that are 
therapeutically or prophylacticaliy useful or both. 

Alternatively, such therapeutic and prophylactic effects can be obtained as a result of the 
expression of polypeptides of the invention within a diseased subject or a subject at risk 
for contracting malaria. Therefore, it is also within the scope of the invention to provide 
uses of any of the nucleic acid molecules of the present invention as medicaments that are 
therapeutically or prophylacticaliy useful or both. 

A currently particular preferred embodiment of the present invention relates to a vaccine 
comprising at least one nucleic acid according to the present invention or at least one 
vector according to the present invention, the vaccine effecting in vivo expression of at 
least one antigen by a subject, to whom the vaccine has been administered, the amount of 
expressed antigen being effective to confer substantially increased resistance to malaria 
caused by Plasmodium falciparum. 

Embodied in the invention is also a method for generating a vaccine against malaria 
comprising 

a) injecting a sequence according to any of claims 1-14 and/or 15-28 in a subject 

b) enabling said subject to generate antibodies specifically recognising any of the 
polypeptide sequences according to claim 1-14 

c) purify said antibodies 

d) selecting antibodies having cross-reactivity to parasites causing malaria 

e) selecting antibodies having the ability to inhibit adhesion to endothelial cells. 

Medical use 
Polypeptides 

One presently particular preferred aspect of the present invention relates to an isolated 
polypeptide comprising an amino acid sequence selected from the group consisting of at 
least one of SEQ ID NO.: 2, SEQ IN NO.: 4 and SEQ ID NO.: 6 for use as a medicament. 
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In a presently preferred embodiment the present invention relates to use of any of the 
amino acid sequences according to the present invention capable of mediating cyto- 
adhesion of intact erythrocyte infected by a parasite to human endothelial cells, but not to 
the CD36 receptor. 

5 

Such amino acid sequences can be segregated by an assay known to the skilled person 
comprising Chinese hamster ovary (CHO) cells transfected to express human CD36 and 
cultured by standard methodology to measure iRBC adhesion to these receptors. In brief, 
parasites can be radiolabeled by incubating the cultures overnight in the presence of 3 H- 

10 phenylalanin (1 MBq for a standard culture containing 200 pi packed RBC). Wildtype and 
CD36-transfected CHO cells can be grown to a monolayer in 96-well microtitre plates 
(Nunc, Roskilde, Denmark). Late-stage-enriched iRBC (100 pi, lxlO 7 RBC/ml) can be 
added to the CHO cell monolayer and incubated for one hour at 37°C before unbound iRBC 
can be washed away from the CHO cell monolayer. Finally, the number of CHO-adhering 

15 iRBC could be determined by liquid scintillation spectrometry. 

In another presently preferred embodiment the present invention relates to use of any of 
the amino acid sequences according to the present invention, wherein said amino acid is 
consistently up-regulated after antibody selection-induced change from VSA UM to VSA SM 
20 expression or following selection for adhesion to bone marrow derived endothelial cells. 
Said amino acids do not belong to PfEMPl groups B, C, B/C, varl or var2 as defined by 
Lavstsen et al. 2003, and are further characterised lacking 1-2 cysteine residues in DBLa 
homology group G (Smith et al. 2000) compared to most PfEMPl molecules in groups B 



and C. 



25 



For the present aspects of the invention it applies, as describe by the inventors, that it is 
an object of preferred embodiments of the present invention to provide any polypeptides, 
which are subject to antibody recognition by antibodies in sera from young children living' 
in areas of high malaria transmission intensity and/or capable of mediating cyto-adhesion 
30 of intact erythrocyte infected by a parasite to endothelial cells, but not the CD36 receptor, 
for use as a medicament. Thus in one embodiment, the amino acid sequence may have at 
least 80% sequence identity to SEQ ID NO.: 2, SEQ ID NO.: 4 and SEQ ID NO.: 6. 

It should be understood by the skilled addressee that any feature and/or aspect 
35 discussed above in connection with the sequence identity of the polypeptides 

according to the invention apply by analogy to antibodies according to the invention. 

In another embodiment, the present invention relates to medical uses of an isolated 
polypeptide, wherein the amino acid sequence is a sub-sequence of with a minimum length 
40 of 10 amino acids. 



Again, it should be understood that any lenght of an isolated polypetide described above 
by analogy applies to the present aspect of the invention. 
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A presently preferred embodiment relates to medical use of an isolated polypeptide 
comprising the amino acid sequence shown in SEQ ID NO:2. 

Another presently preferred embodiment relates to medical use of an isolated polypeptide 
5 comprising the amino acid sequence shown in SEQ ID N0:4. 

Another presently preferred embodiment relates to medical use of an isolated polypeptide 
comprising the amino acid sequence shown in SEQ ID NO:6. 

10 A particular presently preferred embodiment relates to medical use of an isolated 
polypeptide consisting of the amino acid sequence shown in SEQ ID NO:2. 

Another particular presently preferred embodiment relates to medical use of an isolated 
polypeptide consisting of the amino acid sequence shown in SEQ ID NO:4. 

15 

Another particular presently preferred embodiment relates to medical use of an isolated 
polypeptide consisting of the amino acid sequence shown in SEQ ID NO:6. 

One embodiment of the present invention relates to medial use of an isolated polypeptide 
20 according to the present invention, wherein the amino acid sequence has at least 80% 
sequence identity to SEQ ID NO:2. 

Another embodiment of the present invention relates to medial use of an isolated 
polypeptide according to the present invention, wherein the amino acid sequence has at 
25 least 80% sequence identity to SEQ ID NO:4. 

Yet another embodiment of the present invention relates to medial use of an isolated 
polypeptide according to the present invention wherein the amino acid sequence has at 
least 80% sequence identity to SEQ ID NO: 6. 

30 

One specially preferred embodiment of the present invention relates to medical use of any 
of the polypeptides described herein, wherein the amino acid sequence is capable of 
mediating cyto-adhesion of intact erythrocyte infected by a parasite to human endothelial 
cells, but not to the CD36 receptor. 

35 

Another specially preferred embodiment of the present invention relates to medical use of 
any of the polypeptides described herein wherein the amino acid is consistently up- 
regulated after antibody selection-induced change from VSA UM to VSA SM expression. 

40 Nucleic acids 

The embodiments of the present invention thus relate to polypeptides of the PfEMPl class 
or sub-sequences hereof as well as nucleic acid molecules encoding such polypeptides or 
sub-sequences, thus one aspect of the present invention relates to medical uses of an 
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isolated nucleic acid comprising a nucleotide sequence selected from the group consisting 
of at least one of SEQ ID NO.: 1, SEQ ID NO.: 3 SEQ ID NO.: 5 and SEQ ID NO.: 7 for use 
as a medicament. 

5 As described by the present inventors the aim of the present invention is to provide a 
nucleic acid sequence encoding a polypeptide which is capable of mediating cyto-adhesion 
of intact erythrocyte infected by a parasite to human endothelial cells and/or provide a 
nucleic acid sequence which is consistently upregulated after antibody selection-induced 
change from VSA UM to VSA SM expression. 

10 

This one embodiment relates to medical use of a nucleic acid, wherein the nucleotide 
sequence has at least 80% sequence identity to SEQ ID NO.: 1, SEQ ID NO. : 3 SEQ ID 
NO.: 5 and/or SEQ ID NO.: 7. 

15 Another presently preferred embodiment relates to medical use of a nucleic acid wherein 
the nucleotide sequence is a sub-sequence of with a minimum length of 30 nuclotides. 

All the features described herein relating to the nucleic acid molecules of the present 
invention are also applicable as embodiments relating to the medical use of said nucleic 
20 acids, and vice versa. 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleic acid comprises the nucleotide sequence shown in SEQ ID NO:l. 

25 Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleic acid consists of the nucleotide sequence shown in SEQ ID NO:l. 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleic acid comprises the nucleotide sequence shown in SEQ ID NO:3. 

30 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleic acid consists of the nucleotide sequence shown in SEQ ID NO:3. 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
35 the nucleic acid comprises the nucleotide sequence shown in SEQ ID NO: 5. 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleic acid consists of the nucleotide sequence shown in SEQ ID NO:5. 

40 Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleic acid comprises the nucleotide sequence shown in SEQ ID NO:7. 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleic acid consists of the nucleotide sequence shown in SEQ ID NO: 7. 
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Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleotide sequence has at least 80% sequence identity to SEQ ID IMO:l. 

5 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleotide sequence has at least 80% sequence identity to SEQ ID NO:3. 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
10 the nucleotide sequence has at least 80% sequence identity to SEQ ID NO: 5. 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the nucleotide sequence has at least 80% sequence identity to SEQ ID NO:7. 

15 Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
the sequence is consistently upregulated after antibody selection-induced change from 
VSA UM to VSA SM expression. 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
20 the nucleic acid sequence encodes a polypeptide, which is capable of mediating cyto- 
adhesion of intact erythrocyte infected by a parasite to human endothelial cells, but not 
the CD36 receptor. 

Another presently preferred embodiment relates to medical use of a nucleic acid, wherein 
25 the nucleic acid sequence encodes a polypeptide, which is recognised by plasma from 
young semi-immune children. 

In a further preferred embodiment nucleic acid sequence is a re-codonised sequence such 
as SEQ ID NO.: 7. Particularly preferred are sequences that are recodonised in order to 
30 enhance or optimise expression of the resulting protein or polypeptide in a given 
expression system. 

Vector 

Another aspect of the present invention relates to a recombinant vector comprising any of 
35 the nucleic acids defined in the present application operably linked to one or more control 
sequences for use as a medicament. 

Further medical uses 

In another aspect, the present invention relates to use of any of the polypeptides 
40 according to the present invention for the manufacture of a composition to be administered 
in order to prophylacticaily or therapeutically reduce the incidence, prevalence or severity 
of malaria in a subject. 
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Thus, in one embodiment, the present invention relates to use of any of the polypeptides 
according to the present invention for the manufacture of a vaccine for prophylac of 
malaria. 

5 In another embodiment the present invention relates to use of any of the polypeptides 
according to the present invention for the manufacture of a composition for vaccination 
against malaria. 

As will be apparent, the present invention further relates to use of any nucleic acid 
10 according to the present invention for the manufacture of an composition to be 
administered in order to prophylactically or therapeutically reduce the incidence, 
prevalence or severity of malaria in a subject. 

In one embodiment, the present invention relates to use of any nucleic acid according to 
15 the present invention for the manufacture of a vaccine for malaria prophylaxis. 

In another embodiment the present invention relates to use of any nucleic acid according 
to the present invention for the manufacture of a composition for vaccination against 
malaria. 

20 

As the skilled adressee would recognise the present invention also relates to use of a 
recombinant vector according to the present invention for the manufacture of a 
composition to be administered in order to prophylactically or therapeutically reduce the 
incidence, prevalence or severity of malaria in a subject. 

25 

In one embodiment the present invention relates to use of a recombinant vector according 
to the present invention for the manufacture of a vaccine for prophylactic treatment of 
malaria. 

30 In another embodiment the present invention relates to use of a recombinant vector 
according to the present invention for the manufacture of a composition for vaccination 
against malaria. 

In a presently preferred embodiment of the present invention said malaria is caused by 
35 Plasmodium falciparum. 



Treatment 
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It should be understood that any feature and/or aspect discussed above in connection 
with the uses according to the invention apply by analogy to methods of treatment 
according to the invention. 

5 Thus, is one aspect the present invention relates to a method for prophylactically or 
therapeutically reduce the incidence, prevalence or severity of malaria in an subject said 
method comprising administering to said subject an effective amount of a polypeptide, a 
nucleic acid or a recombinant vector according to the present invention. 

10 In one embodiment the present invention relates to a method for the prophylactic 

treatment of malaria in a subject, said method comprises administering to said subject an 
effective amount of a polypeptide, a nucleic or a recombinant vector according to the 
present invention. 

15 In another embodiment the present invention relates to a vaccination method against 
malaria in a subject, said vaccination method comprising administering to said subject an 
effective amount of a polypeptide, a nucleic acid or a recombinant vector according to the 
present invention. 

20 Composition 

Another aspect of the present invention relates to use of the polypeptides and/or the 
nucleic acids as defined by the present application for the preparations of a composition. 
Thus in one embodiment the invention relates to a composition comprising a polypeptide 
according to the invention or a nucleic acid according to the invention and a 
25 pharmaceutical^/ acceptable diluent, carrier or adjuvant. 

In a presently preferred embodiment the composition is an immunogenic composition. 

In a presently most preferred embodiment the composition induces an IgG/IgM antibody 
30 response. 



Production of pharmaceuticals 

Another aspect of the present invention may be the production of pharmaceuticals based 
on polypeptides of the invention or sub-sequences hereof or nucleic acid sequences 
35 encoding such molecules, as described above. Such pharmaceuticals may aiso include 

agents such as but not limited to other polypeptides and in particular antibodies, which are 
capable of modulating and/or inhibit the adhesion of SEQ ID NO.: 2, SEQ ID NO.:4, SEQ ID 
NO.: 6 and/or homologous hereof to endothelial receptors. 
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Accordingly, it is within the scope of the invention to provide the use of any of the amino 
acid sequences according to the present invention for the manufacture of a composition, 
such as an immunogenic composition, which is to be administered in order to 
prophylactically or therapeutically reduce the incidence, prevalence or seventy of severe 
5 malaria (SM) in an individual non-immune to SM causing parasites. 

In addition, the invention also relates to the use of a nucleic acid molecule according to the 
present invention for the manufacture of a composition, such as an immunogenic 
composition, which is to be administered in order to prophylactically or therapeutically 
10 reduce the incidence, prevalence or severity of severe maiaria in an individual non-immune 
to SM causing parasites. 

Delivery of these pharmaceuticals can be performed by any conventional route including, 
but not limited to, transdermal, parenteral, gastrointestinal, transbronchial, and 
15 transaiveolar administration. 

Biotechnological tools 

The use of the nucleic acid and polypeptide-based embodiments of the present invention 
can also extend to their use as biotechnological tools and as components of diagnostic 
assays. 

20 

Thus, one embodiment relates to an in vitro diagnostic method, said method comprising 
contacting a sample with any of the polypeptides according to the present invention under 
conditions allowing an in vitro immunological reaction to occur between said polypeptide 
and the antibodies possibly present in said sample, and in vitro detect the antigen- 
25 antibody complexes possibly formed. 

In a presently preferred embodiment said diagnostic assay, further relates to an in vitro 
diagnostic method, wherein a disease-state profile for a tested subject is generated by 
determining the concentration or expression level in a sample of any of the polypeptide 
30 and/or nucleic acid sequences as defined in the present application. 

Additional embodiments of the invention therefore include an in vitro diagnostic method, 
which comprises contacting a sample such as a tissue or biological fluid with a polypeptide 
comprising a sequence selected from the group consisting of at least one of 

35 

a) SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ ID NO.: 6; and 

b) a sequence having at least 80% sequence identity to a); and 

c) sub-sequences of a) or b) with a minimum length of 30 amino acids; and 

d) sub-sequences of a) or b) comprising at least one B-cel! epitope 
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under conditions allowing an in vitro immunological reaction to occur between said 
polypeptide composition and the antibodies possibly present in the biological sample, and 
the in vitro detection of the antigen-antibody complexes possibly formed. In one preferred 
5 embodiment the polypeptide is immobilised on a solid support. 

Other embodiments include an in vitro diagnostic method, which comprises contacting a 
sample such as a tissue or biological fluid with a nucleotide composition comprising a 
sequence selected from the group consisting of at least one of 

10 

a) SEQ ID NO.: 1, SEQ ID NO.: 3, and SEQ ID NO.: 5 or a sequence 
complementary thereof; and 

b) a nucleotide sequence having at least 80% sequence identity to a); and 

15 

c) sub-sequences of a) or b) with a minimum length of 18 nucleotides 

under conditions allowing an in vitro reaction to occur between said nucleotide composition 
and e.g. primers and probes. 

20 

In some aspects, the nucleic acid embodiments are employed as nucleic acid probes in 
hybridisation assays, in cloning, or as primers for polymerase chain reaction (PCR). 
Similarly, the polypeptide-based embodiments can be used as components of 
immunological reactions such as ELISA, radioimmunoassays (RIA) and adhesion-blocking 
25 assays. The scope of such work can be, for example, to characterise SEQ ID NO.: 2, SEQ 
ID NO. : 4, and/or SEQ ID NO. : 6 or regions of SEQ ID NO. : 2, SEQ ID NO. : 4, and/or SEQ 
ID NO.: 6 involved in interaction with endothelial receptors or antibodies 

In some diagnostic embodiments, nucleic acids complementary to the nucleic acid 
30 molecules of the invention or fragments hereof are used to identify SEQ ID NO.: 1, SEQ ID 
NO.: 3, and/or SEQ ID NO.: 5 nucleic acids (e.g. mRNA) present in a biological sample, for 
instance a tissue sample or a sample of body fluid such as blood or serum. In a preferred 
diagnostic embodiment, nucleic acid molecules complementary to fragments of SEQ ID 
NO.: 1, SEQ ID NO.: 3, and/or SEQ ID NO.: 5 comprising sequences, which are not found 
35 in nucleic acids encoding other SEQ ID NO.: 2, SEQ ID NO.: 4, and/ SEQ ID NO.: 6 

proteins, are used to identify SEQ ID NO.: 1, SEQ ID NO.: 3, and/or SEQ ID NO.: 5 nucleic 
acids (e.g. mRNA) present in a biological sample. 

In other diagnostic embodiments, nucleic acids identical to the nucleic acid molecules of 
40 the invention or fragments hereof are used to identify SEQ ID NO.: 1, SEQ ID NO.: 3, 
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and/or SEQ ID NO. : 5 nucleic acids (e.g. mRNA) present in a biological sample, for 
instance a tissue sample or a sample of body fluid such as blood or serum. In a preferred 
diagnostic embodiment, nucleic acid molecules identical to fragments of SEQ ID NO. : 1, 
SEQ ID NO.: 3, and/or SEQ ID NO.: 5 comprising sequences, which are not found in 
5 nucleic acids encoding other SEQ ID NO.: 2, SEQ ID NO.: 4, and/ SEQ ID NO.: 6 proteins, 
are used to identify SEQ ID NO.: 1, SEQ ID NO.: 3, and/or SEQ ID NO.: 5 nucleic acids 
(e.g. mRNA) present in a biological sample. 

The concentration or transcription level in the infected subject of SEQ ID NO.: 1, 3, and/or 
10 5 nucleic acids or other nucleic acids, which encode proteins that can mediate adhesion to 
endothelial cells will differ depending on the type of Plasmodium infection. Thus, some 
Plasmodium parasites will only cause the expression of low amounts of SEQ ID NO.: 2, 
SEQ ID NO.: 4, and/ or SEQ ID NO.: 6 or no expression at all. Likewise it will not be 
possible to detect any expression of SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO.: 6 
15 in subjects that are not carrying a Plasmodium infection. Accordingly, malaria and, more 
specifically, severe malaria can be diagnosed by determining the concentration of SEQ ID 
NO.: 1, 3, and/or 5 gene transcripts in an individual at risk of contracting this disease. In 
the case of severe malaria such individuals may be e.g. infants or young children who live 
in endemic areas, and previously unexposed individuals travelling into endemic areas. 

20 

One embodiment of the present invention is therefore an in vitro diagnostic method 
whereby infection with Plasmodium and more specifically infection with P. falciparum can 
be detected. In a preferred embodiment, a disease state profile can be created by 
collecting data on the transcription level of SEQ ID NO.: 1, 3, and/or 5 in a large number 

25 of infected subjects and subsequent using these sets of data as reference. The 

concentration or transcription level of SEQ ID NO.: 1, 3, and/or 5 detected in a tested 
subject can then be compared to this reference material so as to predict or follow the 
disease-state of that particular individual. Thus, in some embodiments the term " SEQ ID 
NO.: 1, 3, and/or 5 disease-state profile" refers to the concentration or transcription level 

30 or concentration range or transcription level range of a nucleic acid sequence encoding 
SEQ ID NO. : 2, 4, and/or 6 or a part hereof that is detected in a biological sample. Arrays 
comprising nucleotide probes comprised by the nucleotide sequence of the invention or 
fragments hereof or real-time quantitative PCR can be used to create such disease-state 
profiles. 

35 

In a similar fashion to that discussed above, a SEQ ID NO.: 2, 4, and/or 6 disease-state 
profile comprising concentration levels or concentration range levels of SEQ ID NO.: 2, 
SEQ ID NO. : 4, and/or SEQ ID NO.: 6 amino acid sequences in healthy and diseased 
subjects can be created and used to follow the disease-state of an individual. Accordingly, 
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in some embodiments the term " SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO.: 6 
disease-state profile 11 refers to the concentration or concentration range or the expression 
level or expression level range of a polypeptide corresponding to SEQ ID NO.: 2, SEQ ID 
NO.: 4, and/or SEQ ID NO.: 6 or a part hereof in a biological sample. Preferred methods 
5 for detecting such proteins or polypeptides include radioactive or non-radioactive immune- 
based approaches such as ELISA or radio-immunoassays as well as standard membrane- 
blotting techniques. 

The invention also relates to a method for the in vitro detection of antibodies, which 

10 correlate with malaria originating from the infection of an individual P. falciparum in a 
tissue or biological fluid likely to contain such antibodies. This procedure comprises 
contacting a biological fluid or tissue sample as defined above with a preparation of 
antigens comprising the polypeptide of the invention or any part hereof under conditions, 
which allow an in vitro immunological reaction to occur between these antigens and the 

15 antibodies possibly present in the tissue or fluid. It further comprises the in vitro detection 
of the antigen-antibody complexes possibly formed by the use of conventional techniques. 
As an example, a preferred method involves the use of techniques such as ELISA, as well 
as immuno-fluorescent or radio-immunological assays (RIA) or equivalent procedures. 
Again, such techniques can be used for collecting data on the concentration of antibodies 

20 against the polypeptide of the invention or parts hereof in subjects infected with 
Plasmodium parasites. These data can serve as reference when compared to the 
concentration of antibodies against the polypeptide of the invention detected in a given 
subject and a disease-state profile can be generated on the basis hereof. Thus, in some 
embodiments the term " SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO.: 6 disease- 

25 state profile" refers to the concentration or concentration range of SEQ ID NO.: 2, SEQ ID 
NO.: 4, and/or SEQ ID NO.: 6 antibodies, which are detected in a biological sample. 

Further, some aspects of the invention relate to the process of identifying compounds or 
compositions, which can be employed in the therapeutic treatment or prophylaxis of 
30 malaria. This may for instance be a method for identifying agents capable of modifying the 
SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO.: 6 dependent adhesion to human 
endothelial cells. When contacted with the agent(s) of interest, the adhesion of the iRBC to 
human endothelial cells is avoided. Alternatively, interaction of the agent(s) with the 
expressed polypeptides is avoided. 

35 Kits 

In other aspects of the invention, kits are provided which will simplify the use of the 
polypeptide and nucleotide embodiments of the invention for in vitro diagnostic purposes. 
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Such an in vitro diagnostic kit may comprise 

a) any polypeptide and/or nucleic acid sequence as defined in the present 
5 application, 

b) reagents for preparing a suitable medium for carrying out an 
immunological reaction between an antibody present in a sample of body 
fluid or tissue and said sequence; and 

c) reagents allowing the detection of the antigen-antibody complexes formed, 
10 wherein said reagents may bear a radioactive or non-radioactive label. 

In addition to this component, the kit may comprise reagents for preparing a suitable 
medium for carrying out an immunological reaction between an antibody present in a 
sample of body fluid and said sequence; and reagents allowing the detection of the 
15 antigen-antibody complexes formed, wherein said reagents may bear a radioactive or non- 
radioactive label. 

Such an in vitro diagnostic kit may also comprise a cDNA chip with probes that have 
nucleotide sequences complementary to cDNA of the PfEMPl described in the present 

20 application; primers for amplifying DNA obtained from clinical samples by PGR; and, means 
for labeling amplified DNA hybridized with the probes of the said cDNA chip. The cDNA chip 
may further comprise position markers to locate probes, and staining or labeling is 
performed by using, means for labeling comprising preferably biotin-binding material, most 
preferably, streptavidin-R-phycoerythrin which is a conjugate of a fluorophore and a 

25 protein with biotin-binding sites. The process for preparing cDNA chip contained in the kit 
comprises the steps of: preparing 5' terminal amine-linked DNA probes which have 
nucleotide sequences complementary to cDNA of the PfEMPl described in the present 
application; affixing the DNA probes thus prepared to an aldehyde-derivatized solid 
surface; and, reducing excessive aldehydes not reacted with amine. 

30 

According to the present invention, Plasmodium falciparum in a cell or tissue can be 
detected by quantitatively measuring the level of one or more nucleic acids of the present 
invention using, e.g., real time polymerase chain reaction (PGR) with at least one 
oligonucleotide primer pair or oligonucleotide specific probe being capable of distinguishing 
35 between the PfEMPl described by the present application and other polypeptides. In 

general, a quantitatively measured level in a biological sample from a subject, e.g., human 
of any of the nucleic acids described in the present application that is higher than the 
quantitatively measured level in a biological sample from a normal subject is indicative of 
malaria in the subject. 

40 
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The kit of the invention is an implement that can detect malaria infection in a simple and 
accurate manner, as well as identify the types of infecting paracites, therefore, it may 
contribute to early diagnosis, prevention and treatment of malaria. 

5 Alternatively, the in vitro diagnostic kit may comprise antibodies which specifically 

recognise a sequence selected from the group consisting of at least one of a) SEQ ID NO.: 
2, SEQ ID NO. : 4, and SEQ ID NO.: 6; and b) a sequence having at least 80% sequence 
identity to a); and c) sub-sequences of a) or b) with a minimum length of 30 amino acids; 
and d) sub-sequences of a) or b) comprising at least one B-cell epitope as well as reagents 
10 for preparing a suitable medium for carrying out an immunological reaction between said 
antibody and a sequence possibly present in a sample of body fluid or tissue and reagents 
allowing the detection of the antigen-antibody complexes formed. Said agents or said 
antibodies may optionally bear a radioactive or non-radioactive label. 

15 A variety of assays can be utilized in order to detect antibodies that specifically bind to the 
desired polypeptide. Exemplary assays are described in detail in Antibodies: A Laboratory 
Manual, Harlow and Lane (eds.), Cold Spring Harbor Laboratory Press, 1988. 

Representative examples of such assays include: countercurrent immuno-electrophoresis 
20 (CIEP), radioimmunoassays, radioimmunoprecipitations, enzyme-linked immuno-sorbent 
assays (ELISA), dot blot assays, inhibition or competition assays, and sandwich assays, 
immunostick (dipstick) assays, simultaneous immunoassays, immunochromatographic 
assays, immunofiltration assays, latex bead agglutination assays, immunofluorescent 
assays, biosensor assays, and low-light detection assays. 

25 

Still another aspect of the present invention provides antibodies, as discussed above, for 
detecting Plasmodium falciparum in diagnostic tests. Such antibodies are useful in a wide 
variety of antibody-based assays. As discussed above, exemplary assays are described in 
detail in Antibodies: A Laboratory Manual, (supra); U.S. Pat. No. 4,736,110 and U.S. Pat. 
30 No. 4,486,530. 

These antibody-based diagnostic tests include but are not limited to the following tests: 

A fluorescent antibody test (FA-test) uses a fluorescently-labeled antibody able to bind to 
35 one of the polypeptides of the invention. For detection, visual determinations are made by 
a technician using fluorescence microscopy, yielding a qualitative result. In a preferred 
embodiment, this assay is used for the examination of samples such as but not limited to 
blood samples. 

40 FACS assay using fluorescently-labeled antibodies to bind one of the polypetides of the 
present invention expressed on the surface of RBC, yielding a quantitative result. In a 
preferred embodiment, this assay is used for the examination of samples such as but not 
limited to blood samples. 
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In latex bead agglutination assays, antibodies to one or more of the proteins of the present 
invention are conjugated to latex beads. The antibodies conjugated to the latex beads are 
then contacted with a sample under conditions permitting antibodies to bind to desired 
5 proteins in the sample, if any. The results are then read visually, yielding a qualitative 
result. In a preferred embodiment, this format can be used in the field for on-site testing. 

Enzyme immunoassays (EIA) include a number of different assays able to utilize the 
antibodies provided by the present invention. For example, a heterogeneous indirect EIA 

10 uses a solid phase coupled with an antibody of the invention and an affinity purified, anth 
IgG immunoglobulin preparation. Preferably, the solid phase is a polystyrene microtiter 
plate. The antibodies and immunoglobulin preparation are then contacted with the sample 
under conditions permitting antibody binding, which conditions are well known in the art. 
The results of such an assay can be read visually, but are preferably read using a 

15 spectrophotometer, such as an ELISA plate reader, to yield a quantitative result. 

An alternative solid phase EIA format includes a plastic-coated ferrous metal beads able to 
be moved during the procedures of the assay by means of a magnet. Yet another 
alternative is a low-light detection immunoassay format. In this highly sensitive format, 
20 the fight emission produced by appropriately labeled bound antibodies are quantitated 
automatically. Preferably, the reaction is performed using microtiter plates. 

In a capture-antibody sandwich enzyme assay, the desired protein is bound between an 
antibody attached to a solid phase, preferably a polystyrene microtiter plate, and a labeled 
25 antibody. Preferably, the results are measured using a spectrophotometer, such as an 
ELISA plate reader. 

In an alternative embodiment, a radioactive tracer is substituted for the enzyme mediated 
detection in an EIA to produce a radioimmunoassay (RIA). 

30 

In a sequential assay format, reagents are allowed to incubate with the capture antibody in 
a stepwise fashion. The test sample is first incubated with the capture antibody. Following 
a wash step, incubation with the labeled antibody occurs. In a simultaneous assay, the two 
incubation periods described in the sequential assay are combined. This eliminates one 
35 incubation period plus a wash step. 

A dipstick/immunostick format is essentially an immunoassay except that the solid phase, 
instead of being a polystyrene microtiter plate, is a polystyrene paddle or dipstick. 
Reagents are the same and the format can either be simultaneous or sequential. 

40 

In a chromatographic strip test format, a capture antibody and a labeled antibody are 
dried onto a chromatographic strip, which is typically nitrocellulose or nylon of high 
porosity bonded to cellulose acetate. The capture antibody is usually spray dried as a line 
at one end of the strip. At this end there is an absorbent material that is in contact with 
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the strip. At the other end of the strip the labeled antibody is deposited in a manner that 
prevents it from being absorbed into the membrane. Usually, the label attached to the 
antibody is a latex bead or colloidal gold. The assay may be initiated by applying the 
sample immediately in front of the labeled antibody. 

5 

Immunofiltration/immunoconcentration formats combine a large solid phase surface with 
directional flow of sample/reagents, which concentrates and accelerates the binding of 
antigen to antibody. In a preferred format, the test sample is preincubated with a labeled 
antibody then applied to a solid phase such as fiber filters or nitrocellulose membranes or 
10 the like. The solid phase can also be precoated with latex or glass beads coated with 

capture antibody. Detection of analyte is the same as standard immunoassay. The flow of 
sample/reagents can be modulated by either vacuum or the wicking action of an 
underlying absorbent material. 

15 A threshold biosensor assay is a sensitive, instrumented assay amenable to screening 
large number of samples at low cost. In one embodiment, such an assay comprises the 
use of light addressable potentiometric sensors wherein the reaction involves the detection 
of a pH change due to binding of the desired protein by capture antibodies, bridging 
antibodies and urease-conjugated antibodies. Upon binding, a pH change is effected that is 

20 measurable by translation into electrical potential (pvolts). The assay typically occurs in a 
very small reaction volume, and is very sensitive. Moreover, the reported detection limit of 
the assay is 1,000 molecules of urease per minute. 

For diagnostic methods, which are based on detecting the presence of polypeptides of the 
25 invention sub-sequences with a low degree of sequence identity to polypeptides, which are 
unrelated to SEQ ID NO.: 2, SEQ ID NO. : 4, and/or SEQ ID NO.: 6 are preferred. 

In a preferred embodiment, the kit comprises a solid support to which the antibodies of the 
kit are coupled. Such a support may for instance comprise an organic polymer. 

30 

In an additional embodiment, the kit comprises one or more doses of a vaccine in addition 
to the diagnostic components as described above. It is contemplated that such a kit may 
simplify the process of identifying and treating subjects in need of one of the therapeutic 
or prophylactic embodiments of the invention. Furthermore, the diagnostic components of 
35 a kit may be used to determine the presence of antibodies and thereby the efficiency of 
the vaccine in each individual subject. 

In certain embodiments a kit comprises preparations of the polypeptide and/or nucleotide 
embodiments of the invention filled in a number of separate containers. The containers can 
40 be entirely separate or can be constituted by separate chambers of the same applicator 
device. Where the containers are separate, they could be provided in the form of a kit 
comprising separate dispensers or syringes. Where the containers form part of the same 
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applicator, they could for example, be defined by separate barrels of a multi-barrel 
syringe. A kit may thus comprise containers and/or barrels, where one container or barrel 
contains an immunogenic substance and another container or barrel contains a diluent 
and/or a carrier and/or an adjuvant. Other containers or barrels may contain diagnostic 
5 components. 

Novel agents 

Within the scope of the present invention are also methods for identifying and/or designing 
novel agents useful in the prevention or treatment of malaria. Embodied in the invention is 
10 therefore a method for testing an inhibitor-molecule capable of inhibiting binding of any of 
the polypeptides according to the present invention to a receptor expressed on endothelial 
cells comprising 

a) in vitro cultures of endothelial cells 

b) add potential inhibiting-molecule 

15 c) add RBC infected with parasites, said iRBC expressing any of said 

polypeptide sequences on their surface of the RBC 
d) measure the binding of the iRCB with said endothelial cells by 
microscopy or other means of quantifying binding as for instance liquid 
scintillation spectrometry. 

20 

Embodied in the invention is also a method for identifying an agent, which is capable of 
disrupting the Plasmodium life cycle, and an agent, which specifically modulates SEQ ID 
NO. : 2, SEQ ID NO. : 4, and/or SEQ ID NO. : 6 dependent adhesion to endothelial 
receptors, the method comprising providing a ceil expressing an amino acid sequence 
25 selected from the group consisting of at least one of 

a) SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ ID NO.: 6; and 

b) a sequence having at least 80% sequence identity to a); and 

c) sub-sequences of a) or b) with a minimum length of 30 amino acids; and 
30 d) sub-sequences of a) or b) comprising at least one B-cell epitope 

and contacting said cell with the agent and detecting adhesion of said cell to endothelial 
receptors. 

35 By this approach, an agent, which inhibits adhesion of a polypeptide of the invention to 
endothelial receptors, can be identified by contacting endothelial receptors with 
polypeptides of the invention or sub-sequences thereof in the presence of the agent. 
Detection is accomplished and successful agents identified - according to their ability to 
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induce a desired modulation of the formation of complexes of endothelial receptors and 
polypeptides of the invention. 

In a preferred embodiment, this method is based on the detection of cells, which adhere to 
endothelial cells on a solid support. Again, such a support may for instance comprise a 
resin, a membrane, an organic polymer, a lipid or a cell or part thereof. According to 
another aspect of the invention a support comprising a polypeptide of the invention or a 
fragment thereof coupled to it can be used to capture endothelial receptors and thereby 
identify substances that are capable of modulating the interaction of endothelial receptors 
and a polypeptide of the invention. The method may be based on directly or indirectly 
labelled endothelial receptors or a labelled polypeptide of the invention as well as the 
labelling of whole cells using radioactive as well as non-radioactive techniques. 

Another possibility of using the polypeptide embodiments of the present invention is the 
development of a method for identifying an agent, which interacts with an amino acid 
sequence selected from the group consisting of at least one of 

a) SEQ ID NO.: 2, SEQ ID NO.: 4, and SEQ ID NO.: 6; and 

b) a sequence having at least 80% sequence identity to a); and 

c) sub-sequences of a) or b) with a minimum length of 30 amino acids; and 

d) sub-sequences of a) or b) comprising at least one B-cell epitope; 

said method comprising providing a cell expressing one or more of said polypeptides; 
contacting said cell with the agent; and detecting the interaction of the agent with one or 
more of the said polypeptides. 

In preferred embodiments, the agents identified by the use of these methods are 
monoclonal or polyclonal antibodies. 

In addition, these methods can be used to identify compounds that will induce a desired 
immune response in a subject or patient and thereby serve as valuable tools in the 
development of novel medical compositions as for instance vaccines. Therefore, in a 
preferred embodiment of the Invention, the methods described above are used for 
identifying polypeptides, which will induce a specific antibody response upon 
administration to a subject in need hereof, or nucleotide sequences encoding such amino 
acid sequences. Use of the methods for this purpose comprises injecting into a living 
organism one or more of the polypeptides defined above, contacting a tissue or a biological 
fluid sample from said organism with said polypeptides; allowing an in vitro reaction to 
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occur between the polypeptides and antibodies possibly present in the biological tissue; 
and the in vitro detection of complexes possibly formed. 

An additional preferred embodiment is a method as described above wherein said tissue or 
5 said biological fluid sample is contacted with polypeptides expressed on the surface of a 
cell. 

An equally preferred embodiment is a method as described above wherein said tissue or 
said biological fluid sample is contacted with polypeptides expressed on the surface of 
10 erythrocytes selected for adhesion to endothelial cells or for increased antibody 
recognition. 

Finally, another preferred embodiment of the invention is a method as described above 
wherein said tissue or biological fluid sample is contacted with polypeptides immobilised on 
15 a solid support. 

In other embodiments, protein models of the polypeptides of the invention are constructed 
by the use of conventional techniques within molecular biology. Agents that interact with 
polypeptides of the invention are constructed and approaches in combinatorial chemistry 

20 are employed in the development of agents that modulate SEQ ID NO.: 2, SEQ ID NO.: 4, 
and/or SEQ ID NO.: 6 mediated interaction with endothelial receptors or are able to induce 
an immune response. Accordingly, novel agents that interact with SEQ ID NO.: 2, SEQ ID 
NO.: 4, and/or SEQ ID NO.: 6 are developed, screened in a SEQ ID NO.: 2, SEQ ID NO.: 4, 
and/or SEQ ID NO.: 6 characterisation assay, for instance a SEQ ID NO.: 2, SEQ ID NO.: 

25 4, and/or SEQ ID NO.: 6 anti-adhesion assay as described above. The identity of each 
agent and its performance in the SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO.: 6 
characterisation assay, its effect on the modulation of SEQ ID NO.: 2, SEQ ID NO.: 4, 
and/or SEQ ID NO.: 6-mediated adhesion to endothelial cells or its ability to induce an 
immune response is recorded on electronic or non-electronic media. These recorded data 

30 can serve as the basis for a library of SEQ ID NO. : 2, SEQ ID NO.: 4, and/or SEQ ID NO.: 
6 modulating agents. Such a library can again be employed to further identify agents that 
modulate SEQ ID NO.: 2, SEQ ID NO.: 4, and/or SEQ ID NO.: 6-mediated adhesion to 
endothelial cells and can be valuable tools for selecting an appropriate pharmaceutical to 
treat a particular type of infection with Plasmodium. It is further expected that the high 

35 throughput screening techniques currently in use within the biotech and pharmaceutical 
industries can readily be applied to the procedures outlined above. 
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Host cells 

With respect to the above embodiments, the invention further relates to host cells 
comprising the above-described nucleic acid molecules. The nucleic acid molecules may be 
transformed, stably transfected or transiently transfected into the host cell or infected into 
5 the host ceil by a live attenuated virus. The preferred host cells may include, but are not 
limited to, prokaryotic cells, such as Escherichia coli, Staphylococcus aureus, and 
eukaryotic cells, such as Sacchromyces cerevisiae and Pichia pastoris, CHO and COS cells 
as well as Baculovirus infected hi-five or sf9 insect cells. Transformation with the 
recombinant molecules can be effected using methods well known in the art. 

10 

It should be understood that any feature and/or aspect discussed above in connection with 
the use according to the invention apply by analogy to methods of treatment or prevention 
of malaria according to the invention. 

15 It should be understood that in a particular preferred embodiment of all the ascpects of the 
present invention, the malaria is severe malaria. 

As will be apparent, preferred features and characteristics of one aspect of the invention 
may be applicable to other aspects of the invention. 

20 

Throughout this specification the word "comprise", or variations such as "comprises" or 
"comprising", will be understood to imply the inclusion of a stated element, integer or step, 
or group of elements, integers or steps, but not the exclusion of any other element, 
integer or step, or group of elements, integers or steps. 

25 

All patent and non-patent references cited in the present application, are hereby 
incorporated by reference in their entirety. 

The invention will hereinafter be described by way of the following non-limiting Definitions, 
30 Figures and Examples. 

Definitions 

"Vaccine" refers to a preparation of SEQ ID NO. : 1, and/or SEQ ID NO. : 2, and/or SEQ ID 
NO.: 3, SEQ and/or ID NO.: 4, and/or SEQ ID NO.: 5, and/or SEQ ID NO.: 6 and/or the 
35 codon optimised SEQ ID NO.: 7, which can induce protective immunity against severe 
malaria, but which does not itself cause disease. 



40 



"Medicament" relates to any composition comprising any of the polypeptides and/or nucleic 
acids describe herein for treatment of malaria and/or preventition of initiation of malaria 
and/or prophylaxis of malaria infection. 
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'VSA' refers to variant surface antigens expressed on the surface of RBC infected by 
Plasmodium falciparum. In the present context the variant surface antigen is PfEMPl. 

'Serological phenotype' refers to the antibody profile obtained by FACS analysis of RBC 
5 infected by P. falciparum expressing VSA on the surface of said RBC. 

3D7 refers to a specific laboratory isolate of a Plasmodium falciparum 3D7, which is a long- 
term clone derived from P. falciparum NF54 isolated from a Dutch malaria patient 
(Delemarre and Van der Kaay, 1979). 

10 

Unless otherwise defined herein or below in the remainder of the specification, all technical 
and scientific terms used herein have the same meaning as commonly understood by 
those of ordinary skill in the art to which the invention belongs. 

15 'SEQ ID NO.: I 1 is defined as the sequence of the two identical genes with the PlasmoDB 
accession numbers PFD1235w and MAL7P1.1 ( http://www.plasmodb.org ) and the NCBI 
accession number NC 004318 . 

'SEQ ID NO.: 2' is defined as the protein product of the two identical genes with the 
20 PlasmoDB accession numbers PFD1235w and MAL7P1.1 ( http://www.plasmodb.org ) and 
the NCBI accession number NC 004318 . 

The term 'VAR4' is defined as SEQ ID NO. : 2. 

25 'SEQ ID NO.: 3' is defined as the sequence of the gene with the PlasmoDB accession 
number PF11_0008 ( http://www.plasmodb.org ) and the NCBI accession number 
NC 004315 . 

'SEQ ID NO.: 4' is defined as the protein product of the gene with PlasmoDB accession 
number PF11_0008 f http://www. plasmodb.org ^ and the NCBI accession number 
30 NC 004315 . 

! SEQ ID NO.: 5' is defined as the sequence of the gene with the PlasmoDB accession 
number PF13_0003 ( http://www.plasmodb.org ) and the NCBI accession number 
NC 004331 . 

35 'SEQ ID NO.: 6 1 is defined as the protein product of the gene with the PlasmoDB accession 
number PF13_0003 ( http://www.plasmodb.oro ') and the NCBI accession number 
NC 004331 . 

The term 'VAR6' is defined as SEQ ID NO.: 6. 

40 

The therm "War" 1 is defined as the DNA and amino acid sequence identified by the 
PlasmoDB accession numbers PFE1640w ( http://www.plasmodb.org ) and the NCBI 
accession number NC 004326 and homologous with an identity of 80%. 



WO 2005/063804 



50 



PCT/DK2004/000922 



The term "var2" is defined as the DNA and amino acid sequences identified by the 
PlasmoDB accession numbers PL0030c ( http://www.piasmodb.org ) and the NCBI accessior 
number NC 004316 and homologous with an identity of 80%. 

5 The term "var3" is defined as three DNA and amino acid sequences identified by the 
PlasmoDB accession numbers PFAOOISc, MALGP1.314, and PFI1820w 
( http://www.plasmodb.org ) and the NCBI accession numbers NC 004325 , NC 004327 , 
and NC 004330 respectively and homologous with an identity of 80%. 

10 The term "Exon 2" refers to the nucleotides no. 9444-10662 of SEQ ID NO.: 1, the 

nucleotides no. 7722-8985 of SEQ ID NO.: 3, the nucleotides no. 8847-10041 of SEQ ID 
NO.: 5, amino acid no.3148-3553 of SEQ ID NO.: 2, amino acid no. 2574-2994 of SEQ ID 
NO.: 4, and amino acid no. 2949-3346 of SEQ ID NO.: 6. 

15 The term "Fragment 1" of SEQ ID NO. : 1, SEQ ID NO. : 3, SEQ ID NO. : 5 and SEQ ID NO. : 
7 refers to nucleotides no. 1-300 and amino acid no. 1-100 of SEQ ID NO.: 2, SEQ ID NO.: 
4, and SEQ ID NO.: 6. 

The term "Fragment 2" of SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID NO.: 
20 7 refers to nucleotides no. 301-600 and amino acid no. 101-200 of SEQ ID NO.: 2, SEQ ID 
NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 3" of SEQ ID NO. 1:, SEQ ID NO.: 3, SEQ ID NO. : 5 and SEQ ID NO.: 
7 refers to nucleotides no. 601-900 and amino acid no. 201-300 of SEQ ID NO.: 2, SEQ ID 
25 NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 4" of SEQ ID NO. : 1, SEQ ID NO. : 3, SEQ ID NO. : 5 and SEQ ID NO. : 
7 refers to nucleotides no. 901-1200 and amino acid no. 301-400 of SEQ ID NO.: 2, SEQ 
ID NO.: 4, and SEQ ID NO.: 6. 

30 

The term "Fragment 5" of SEQ ID NO. 1:, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID NO.: 
7 refers to nucleotides no. 1201-1500 and amino acid no. 401-500 of SEQ ID NO.: 2, SEQ 
ID NO.: 4, and SEQ ID NO.: 6. 

35 The term "Fragment 6" of SEQ ID NO. 1:, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID NO.: 
7 refers to nucleotides no. 1501-1800 and amino acid no. 501-600 of SEQ ID NO.: 2, SEQ 
ID NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 7" of SEQ ID NO. 1:, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID NO.: 
40 7 refers to nucleotides no. 1801-2100 and amino acid no. 601-700 of SEQ ID NO.: 2, SEQ 
ID NO.: 4, and SEQ ID NO.: 6. 
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The term "Fragment 8" of SEQ ID NO. : 1, SEQ ID NO. : 3, SEQ ID NO. : 5 and SEQ ID NO. : 
7 refers to nucleotides no. 2101-2400 and amino acid no. 701-800 of SEQ ID NO.: 2, SEQ 
ID NO.: 4, and SEQ ID NO.: 6. 

5 

The term "Fragment 9" of SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID NO.: 
7 refers to nucleotides no. 2401-2700 and amino acid no. 801-900 of SEQ ID NO.: 2, SEQ 
ID NO. : 4, and SEQ ID NO.: 6. 

10 The term "Fragment 10" of SEQ ID NO.: 1, SEQ ID NO.: 3 7 SEQ ID NO.:5 and SEQ ID NO.: 
7 refers to nucleotides no. 2701-3000 and amino acid no. 901-1000 of SEQ ID NO. : 2, 
SEQ ID NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 11" of SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID 
15 NO.: 7 refers to nucleotides no. 3001-3300 and amino acid no. 1001-1100 of SEQ ID NO.: 
2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 12" of SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID 
NO.: 7 refers to nucleotides no. 3301-3600 and amino acid no. 1101-1200 of SEQ ID NO.: 
20 2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 13" of SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID 
NO.: 7 refers to nucleotides no. 3601-3900 and amino acid no. 1201-1300 of SEQ ID NO.: 
2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

25 

The term "Fragment 14" of SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID 
NO.: 7 refers to nucleotides no. 3901-4200 and amino acid no. 1301-1400 of SEQ ID NO.: 
2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

30 The term "Fragment 15" of SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID 

NO.: 7 refers to nucleotides no. 4201-4500 and amino acid no. 1401-1500 of SEQ ID NO.: 
2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 16" of SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID 
35 NO.: 7 refers to nucleotides no. 4501-4800 and amino acid no. 1501-1600 of SEQ ID NO.: 
2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 17" of SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID 
NO.: 7 refers to nucleotides no. 4801-5100 and amino acid no. 1601-1700 of SEQ ID NO.: 
40 2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 
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The term "Fragment 18" of SEQ ID NO. : 1, SEQ ID NO. : 3, SEQ ID NO. : 5 and SEQ ID 
NO.: 7 refers to nucleotides no. 5101-5400 and amino acid no. 1701-1800 of SEQ ID NO.: 
2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

5 

The term "Fragment 19" of SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID 
NO. : 7 refers to nucleotides no. 5401-5700 and amino acid no. 1801-1900 of SEQ ID NO.: 
2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

10 The term "Fragment 20" of SEQ ID NO. : 1, SEQ ID NO. : 3, SEQ ID NO. : 5 and SEQ ID 

NO.: 7 refers to nucleotides no. 5701-6000 and amino acid no. 1901-2000 of SEQ ID NO.: 
2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 21" of SEQ ID NO.: 1, SEQ ID NO. : 3, SEQ ID NO.: 5 and SEQ ID 
15 NO.: 7 refers to nucieotides no. 6001-6300 and amino acid no. 2001-2100 of SEQ ID NO.: 
2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 22" of SEQ ID NO. : 1, SEQ ID NO. : 3, SEQ ID NO. : 5 and SEQ ID 
NO.: 7 refers to nucieotides no. 6301-6600 and amino acid no. 2101-2200 of SEQ ID NO.: 
20 2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 23" of SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 refers to 
nucleotides no. 6601-6900 and amino acid no. 2201-2300 of SEQ ID NO.: 2, SEQ ID NO.: 
4, and SEQ ID NO.: 6. 

25 

The term "Fragment 24" of SEQ ID Np.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID 
NO.: 7 refers to nucleotides no. 6901-7200 and amino acid no. 2301-2400 of SEQ ID NO.: 
2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

30 The term "Fragment 25" of SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5 and SEQ ID 

NO. : 7 refers to nucleotides no. 7201-7500 and amino acid no. 2401-2500 of SEQ ID NO. : 
2, SEQ ID NO.: 4, and SEQ ID NO.: 6. 

The term "Fragment 26" of SEQ ID NO.: 1 SEQ ID NO. : 5 and SEQ ID NO.: 7 refers to 
35 nucleotides no. 7501-7800 and amino acid no. 2501-2600 of SEQ ID NO.: 2 and SEQ ID 
NO.: 6. 

The term "Fragment 26" of SEQ ID NO.: 3 refers to nucleotides no. 7501-7719 and amino 
acid no. 2501-2573 of SEQ ID NO.: 4. 

40 



WO 2005/063804 



PCT/DK2004/000922 



53 

The term "Fragment 27" of SEQ ID NO.: 1 SEQ ID NO.: 5 and SEQ ID NO.: 7 refers to 
nucleotides no. 7801-8100 and amino acid no. 2601-2700 of SEQ ID NO.: 2 and 6. 

The term "Fragment 28" of SEQ ID NO.: 1 SEQ ID NO.: 5 and SEQ ID NO.: 7 refers to 
5 nucleotides no. 8101-8400 and amino acid no. 2701-2800 of SEQ ID NO.: 2 and SEQ ID 
NO.: 6. 

The term "Fragment 29" of SEQ ID NO.: 1 SEQ ID NO.: 5 and SEQ ID NO.: 7 refers to 
nucleotides no. 8401-8700 and amino acid no. 2801-2900 of SEQ ID NO.: 2 and SEQ ID 
10 NO.: 6. 

The term "Fragment 30" of SEQ ID NO.: 1 and SEQ ID NO.: 7 refers to nucleotides no. 
8701-9000 and amino acid no. 2901-3000 of SEQ ID NO.: 2. 

15 The term "Fragment 30" of SEQ ID NO.: 5 refers to nucleotides no. 8701-8844 and amino 
acid no. 2901-2948 of SEQ ID NO.: 6. 

The term "Fragment 31" of SEQ ID NO.: 1 and SEQ ID NO. : 7 refers to nucleotides no. 
9001-9300 and amino acid no. 3001-3100 of SEQ ID NO.: 2. 

20 

The term "Fragment 32" of SEQ ID NO. : 1 refers to nucleotides no. 9301-9441 and amino 
acid no. 3101-3147 of SEQ ID NO.: 2. 

The term "Fragment 32" of SEQ ID NO.: 7 refers to nucleotides no. 9301-9473 . 

25 

The term 'adhesion to endothelial cells' or 'cytoadhesion' refers to the ability of 
erythrocytes infected by P. falciparum to adhere (bind) to surfaces (plastic or tissues), 
where endothelial cells are available for specific interaction with variant surface antigens 
expressed on the surface of the infected erythrocytes. The capacity of a given parasite 
30 isolate/line/clone for adhesion to endothelial cells in vitro is defined as the proportion of 
parasitised erythrocytes that can withstand washing after having been allowed to adhere 
(bind) to endothelial cells. 

A "polypeptide" (e.g., a protein, polypeptide, peptide, etc.) is a polymer of amino acids 
35 comprising naturally occurring amino acids or artificial amino acid analogues, or a 

character string representing an amino acid polymer, depending on context. Given the 
degeneracy of the genetic code, one or more nucleic acids, or the complementary nucleic 
acids thereof, that encode a specific polypeptide sequence can be determined from the 
polypeptide sequence. 



A "polynucleotide" (e.g., a nucleic acid, polynucleotide, oligonucleotide, etc.) is a polymer 
of nucleotides comprising nucleotides A,C,T,U,G, or other naturally occurring nucleotides or 
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artificial nucleotide analogues, or a character string representing a nucleic acid, depending 
on context. Either the given nucleic acid or the complementary nucleic acid can be 
determined from any specified polynucleotide sequence. 

5 Numbering of a given amino acid polymer or nucleotide polymer "corresponds to" or is 
"relative to" the numbering of a selected amino acid polymer or nucleic acid polymer when 
the position of any given polymer component (e.g., amino acid, nucleotide, also referred to 
generically as a "residue") is designated by reference to the same or an equivalent position 
in the selected amino acid or nucleotide polymer, rather than by the actual numerical 
10 position of the component in the given polymer. Thus, for example, the numbering of a 
given amino acid position in a given polypeptide sequence corresponds to the same or 
equivalent amino acid position in a selected polypeptide sequence used as a reference 
sequence. 

15 An "equivalent position" (for example, an "equivalent amino acid position" or "equivalent 
residue position") is defined herein as a position (such as, an amino acid position or a 
residue position) of a test polypeptide sequence which aligns with a corresponding position 
of a reference polypeptide sequence, using for example an alignment algorithm as 
described herein such as, for example, the CLUSTALW alignment program using default 

20 parameters. The equivalent amino acid position of the test polypeptide sequence need not 
have the same numerical position number as the corresponding position of the test 
polypeptide. 

A "variant" is a polypeptide comprising a sequence, which differs (by deletion of an amino 
25 acid, insertion of an amino acid, and/or substitution of an amino acid for a different amino 
acid) in one or more amino acid positions from that of a parent polypeptide sequence. The 
variant sequence may be a non-naturally occurring sequence, i.e., a sequence not found in 
nature. 

30 "Naturally occurring" as applied to an object refers to the fact that the object can be found 
in nature as distinct from being artificially produced by man. For example, a polypeptide or 
polynucleotide sequence that is present in an organism (including viruses, bacteria, 
protozoa, insects, plants or mammalian tissue) that can be isolated from a source in 
nature and which has not been intentionally modified by man in the laboratory is naturally 

35 occurring. "Non-naturally occurring" as applied to an object means that the object is not 
naturally-occurring — i.e., the object cannot be found in nature as distinct from being 
artificially produced by man. 

"Binding" between two molecules, e.g., a ligand and a receptor, means a preferential 
40 binding of one molecule for another in a mixture of molecules. The binding of the 

molecules is typically considered specific if the binding affinity is about 1 x 10 4 M" 1 to about 
1 x 10 9 M" 1 or greater (i.e., K D of about 10" 4 to 10" 9 M or less). Binding affinity of a ligand 
and a receptor may be measured by standard techniques known to those of skill in the art. 
Non-limiting examples of well-known techniques for measuring binding affinities include 
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Biacore® technology (Biacore AB, Sweden), isothermal titration microcaiorimetry (MicroCal 
LLC, Northampton, MA USA), ELISA, and FACS. For example, FACS or other sorting 
methods may be used to select for populations of molecules (such as for example, cell 
surface-displayed ligands), which specifically bind to the associated binding pair member 
5 (such as a receptor, e.g., a soluble receptor). Ligand-receptor complexes may be detected 
and sorted e.g., by fluorescence (e.g., by reacting the complex with a fluorescent antibody 
that recognizes the complex). Molecules of interest, which bind an associated binding pair 
member (e.g., receptor) are pooled and re-sorted in the presence of lower concentrations 
of receptor. By performing multiple rounds sorting in the presence of decreasing 
10 concentrations of receptor (an exemplary concentration range being on the order of 10" 6 M 
down to 1CT 9 M, i.e., 1 micromolar (|iM) down to 1 nanomolar (nM), or less, depending on 
the nature of the ligand-receptor interaction), populations of the molecule of interest 
exhibiting specific binding affinity for the receptor may be isolated. 

15 A polypeptide, nucleic acid, or other component is "isolated" when it is partially or 
completely separated from components with which it is normally associated (other 
peptides, polypeptides, proteins (including complexes, e.g., polymerases and ribosomes 
which may accompany a native sequence), nucleic acids, cells, synthetic reagents, cellular 
contaminants, cellular components, etc.), e.g., such as from other components with which 

20 it is normally associated in the cell from which it was originally derived. A polypeptide, 
nucleic acid, or other component is isolated when it is partially or completely recovered or 
separated from other components of its natural environment such that it is the 
predominant species present in a composition, mixture, or collection of components (i.e., 
on a molar basis it is more abundant than any other individual species in the composition). 

25 In some instances, the preparation consists of more than about 60%, 70% or 75%, 
typically more than about 80%, or preferably more than about 90% of the isolated 
species. 

In one aspect, a "substantially pure" or "isolated" nucleic acid (e.g., RNA or DNA), 
30 polypeptide, protein, or composition also means where the object species (e.g., nucleic 
acid or polypeptide) comprises at least about 50, 60, or 70 percent by weight (on a molar 
basis) of all macromolecular species present. A substantially pure or isolated composition 
can also comprise at least about 80, 90, or 95 percent by weight of all macromolecular 
species present in the composition. An isolated object species can also be purified to 
35 essential homogeneity (contaminant species cannot be detected in the composition by 
conventional detection methods) wherein the composition consists essentially of 
derivatives of a single macromolecular species. The term "purified" generally denotes that 
a nucleic acid, polypeptide, or protein gives rise to essentially one band in an 
electrophoretic gel. It typically means that the nucleic acid, polypeptide, or protein is at 
40 least about 50% pure, 60% pure, 70% pure, 75% pure, more preferably at least about 
85% pure, and most preferably at least about 99% pure. 

The term "isolated nucleic acid" may refer to a nucleic acid (e.g., DNA or RNA) that is not 
immediately contiguous with both of the coding sequences with which it is immediately 
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contiguous (i.e., one at the 5 r and one at the 3' end) in the naturally occurring genome of 
the organism from which the nucleic acid of the invention is derived. Thus, this term 
includes, e.g., a cDNA or a genomic DNA fragment produced by polymerase chain reaction 
(PCR) or restriction endonuclease treatment, whether such cDNA or genomic DNA 
5 fragment is incorporated into a vector, integrated into the genome of the same or a 
different species than the organism, including, e.g., a virus, from which it was originally 
derived, linked to an additional coding sequence to form a hybrid gene encoding a chimeric 
polypeptide, or independent of any other DNA sequences. The DNA may be double- 
stranded or single-stranded, sense or anti-sense. 

10 

A "recombinant polynucleotide" or a "recombinant polypeptide" is a non-naturally occurring 
polynucleotide or polypeptide that includes nucleic acid or amino acid sequences, 
respectively, from more than one source nucleic acid or polypeptide, which source nucleic 
acid or polypeptide can be a naturally occurring nucleic acid or polypeptide, or can itself 

15 have been subjected to mutagenesis or other type of modification. A nucleic acid or 

polypeptide may be deemed "recombinant" when it is artificial or engineered, or derived 
from an artificial or engineered polypeptide or nucleic acid. A recombinant nucleic acid 
(e.g., DNA or RNA) can be made by the combination (e.g., artificial combination) of at 
least two segments of sequence that are not typically included together, not typically 

20 associated with one another, or are otherwise typically separated from one another. A 
recombinant nucleic acid can comprise a nucleic acid molecule formed by the joining 
together or combination of nucleic acid segments from different sources and/or artificially 
synthesized. A "recombinant polypeptide" (or "recombinant protein") often refers to a 
polypeptide (or protein) that results from a cloned or recombinant nucleic acid or gene. 

25 The source polynucleotides or polypeptides from which the different nucleic acid or amino 
acid sequences are derived are sometimes homologous (i.e., have, or encode a 
polypeptide that encodes, the same or a similar structure and/or function), and are often 
from different isolates, serotypes, strains, species, of organism or from different disease 
states, for example. 

30 

The term "recombinant" when used with reference, e.g., to a cell, nucleotide, vector, 
protein, or polypeptide typically indicates that the cell, nucleotide, or vector has been 
modified by the introduction of a heterologous (or foreign) nucleic acid or the alteration of 
a native nucleic acid, or that the protein or polypeptide has been modified by the 

35 introduction of a heterologous amino acid, or that the cell is derived from a cell so 

modified. Recombinant cells express nucleic acid sequences (e.g., genes) that are not 
found in the native (non-recombinant) form of the cell or express native nucleic acid 
sequences (e.g., genes) that would be abnormally expressed under-expressed, or not 
expressed at all. The term "recombinant" when used with reference to a cell indicates that 

40 the cell replicates a heterologous nucleic acid, or expresses a peptide or protein encoded 
by a heterologous nucleic acid. Recombinant cells can contain genes that are not found 
within the native (non-recombinant) form of the cell. Recombinant cells can also contain 
genes found in the native form of the cell wherein the genes are modified and re- 
introduced into the cell by artificial means. The term also encompasses cells that contain a 
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nucleic acid endogenous to the cell that has been modified without removing the nucleic 
acid from the cell; such modifications include those obtained by gene replacement, site- 
specific mutation, and related techniques. 

5 The term "recombinant^ produced" refers to an artificial combination usually accomplished 
by either chemical synthesis means, recursive sequence recombination of nucleic acid 
segments or other diversity generation methods (such as, e.g., shuffling) of nucleotides, or 
manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
known to those of ordinary skill in the art. "Recombinants expressed" typically refers to 
10 techniques for the production of a recombinant nucleic acid in vitro and transfer of the 
recombinant nucleic acid into cells in vivo, in vitro, or ex vivo where it may be expressed 
or propagated. 

The term "upregulated" in the aspects of the present invention refers to detection of a 
15 transcript by real-time quantitative PCR of any of the malaria parasite nucleotides of the 
present invention, wherein the nucleotide transcription level is evaluated, when compared 
to a housekeeping gene such as but not limited seryl-tRNA-transferase. When a 
transcription level of less than 100 times than that of the housekeeping gene, the 
evaluation is excluded. Any transcription level above this, wherein there is a difference of 
20 at least 2 times between the transcription level of the malaria parasite var gene in the 
parasite culture of interest e.g. the antibody and/or endothelial cell selected 3D7 (3D7 SM i, 
3D7 end0 ) culture as compared to the control parasite culture eg the 3D7 parasite culture, 
said gene is upregulated. The assay of the present application is used for reference. 

25 An "immunogen" refers to a substance capable of provoking an immune response, and 

includes, e.g., antigens, autoantigens that play a role in induction of autoimmune diseases, 
and tumor-associated antigens expressed on cancer cells. An immune response generally 
refers to the development of a cellular or antibody-mediated response to an agent, such as 
an antigen or fragment thereof or nucleic acid encoding such agent. In some instances, 

30 such a response comprises a production of at least one or a combination of CTLs, B cells, 
or various classes of T ceils that are directed specifically to antigen-presenting cells 
expressing the antigen of interest. 

An "antigen" refers to a substance that is capable of eliciting the formation of antibodies in 
35 a host or generating a specific population of lymphocytes reactive with that substance. 
Antigens are typically macromolecules (e.g., proteins and polysaccharides) that are foreign 
to the host. 

An "adjuvant" refers to a substance that enhances an antigen's immune-stimulating 
40 properties or the pharmacological effect(s) of a drug. An adjuvant may non-specifically 
enhance the immune response to an antigen. "Freund's Complete Adjuvant," for example, 
is an emulsion of oil and water containing an immunogen, an emulsifying agent and 
mycobacteria. Another example, "Freund's incomplete adjuvant," is the same, but without 
mycobacteria. 
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An "immunogenic composition" refers to a composition that will evoke an immune 
response when administered to a subject possessing an immune system. 

5 A "vector" is a component or composition for facilitating cell transduction or transfection by 
a selected nucleic acid, or expression of the nucleic acid in the cell. Vectors include, e.g., 
phages, plasmids, cosmids, viruses, YACs, bacteria, poiy-lysine, etc. An "expression 
vector" is a nucleic acid construct or sequence, generated recomblnantly or synthetically, 
with a series of specific nucleic acid elements that permit transcription of a particular 
10 nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or nucleic 
acid fragment. The expression vector typically includes a nucleic acid to be transcribed 
operably linked to a promoter. The nucleic acid to be transcribed is typically under the 
direction or control of the promoter. 

15 The term "immunoassay" includes an assay that uses an antibody or immunogen to bind 
or specifically bind an antigen. The immunoassay is typically characterized by the use of 
specific binding properties of a particular antibody to isolate, target, and/or quantify the 
antigen. 

20 The term "homology" generally refers to the degree of similarity between two or more 
structures. The term "homologous sequences" refers to regions in macromolecules that 
have a similar order of monomers. When used in relation to nucleic acid sequences, the 
term "homology" refers to the degree of similarity between two or more nucleic acid 
sequences (e.g., genes) or fragments thereof. Typically, the degree of similarity between 

25 two or more nucleic acid sequences refers to the degree of similarity of the composition, 
order, or arrangement of two or more nucleotide bases (or other genotypic feature) of the 
two or more nucleic acid sequences. The term "homologous nucleic acids" generally refers 
to nucleic acids comprising nucleotide sequences having a degree of similarity in nucleotide 
base composition, arrangement, or order. The two or more nucleic acids may be of the 

30 same or different species or group. The term "percent homology" when used in relation to 
nucleic acid sequences, refers generally to a percent degree of similarity between the 
nucleotide sequences of two or more nucleic acids. When used in relation to polypeptide 
(or protein) sequences, the term "homology" refers to the degree of similarity between two 
or more polypeptide (or protein) sequences (e.g., genes) or fragments thereof. Typically, 

35 the degree of similarity between two or more polypeptide (or protein) sequences refers to 
the degree of similarity of the composition, order, or arrangement of two or more amino 
acid of the two or more polypeptides (or proteins). The two or more polypeptides (or 
proteins) may be of the same or different species or group. The term "percent homology" 
when used in relation to polypeptide (or protein) sequences, refers generally to a percent 

40 degree of similarity between the amino acid sequences of two or more polypeptide (or 
protein) sequences. 

The term "homologous polypeptides" or "homologous proteins" generally refers to 
polypeptides or proteins, respectively, that have amino acid sequences and functions that 
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are similar. Such homologous polypeptides or proteins may be related by having amino 
acid sequences and functions that are similar, but are derived or evolved from different or 
the same species using the techniques described herein. 

5 The term "subject" as used herein includes, but is not limited to, an organism; a mammal, 
including, e.g., a human, non-human primate (e.g., baboon, orangutan, monkey), mouse, 
pig, cow, goat, cat, rabbit, rat, guinea pig, hamster, horse, monkey, sheep, or other non- 
human mammal; a non-mammal, including, e.g., a non-mammalian vertebrate, such as a 
bird (e.g., a chicken or duck) or a fish, and a non-mammalian invertebrate. 

10 

The term "pharmaceutical composition" means a composition suitable for pharmaceutical 
use in a subject, including an animal or human. A pharmaceutical composition generally 
comprises an effective amount of an active agent and a carrier, including, e.g., a 
pharmaceutical^ acceptable carrier. 

15 

The term "effective amount" means a dosage or amount sufficient to produce a desired 
result. The desired result may comprise an objective or subjective improvement in the 
recipient of the dosage or amount. 

20 A "prophylactic treatment" is a treatment administered to a subject who does not display 
signs or symptoms of a disease, pathology, or medical disorder, or displays only early 
signs or symptoms of a disease, pathology, or disorder, such that treatment is 
administered for the purpose of diminishing, preventing, or decreasing the risk of 
developing the disease, pathology, or medical disorder. A prophylactic treatment functions 

25 as a preventative treatment against a disease or disorder. A "prophylactic activity" is an 
activity of an agent, such as a nucleic acid, vector, gene, polypeptide, protein, substance, 
or composition thereof that, when administered to a subject who does not display signs or 
symptoms of pathology, disease or disorder, or who displays only early signs or symptoms 
of pathology, disease, or disorder, diminishes, prevents, or decreases the risk of the 

30 subject developing a pathology, disease, or disorder. 

A "prophyiactically useful" agent or compound (e.g., nucleic acid or polypeptide) refers to 
an agent or compound that is useful in diminishing, preventing, treating, or decreasing 
development of pathology, disease or disorder. 

35 

A "therapeutic treatment" is a treatment administered to a subject who displays symptoms 
or signs of pathology, disease, or disorder, in which treatment is administered to the 
subject for the purpose of diminishing or eliminating those signs or symptoms of 
pathology, disease, or disorder. A "therapeutic activity" is an activity of an agent, such as 
40 a nucleic acid, vector, gene, polypeptide, protein, substance, or composition thereof, that 
eliminates or diminishes signs or symptoms of pathology, disease or disorder, when 
administered to a subject suffering from such signs or symptoms. A "therapeutically 
useful" agent or compound (e.g., nucleic acid or polypeptide) indicates that an agent or 
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compound is useful in diminishing, treating, or eliminating such signs or symptoms of a 
pathology, disease or disorder. 

The term "gene" broadly refers to any segment of DNA associated with a biological 
5 function. Genes include coding sequences and/or regulatory sequences required for their 
expression. Genes also include non-expressed DNA nucleic acid segments that, e.g., form 
recognition sequences for other proteins (e.g., promoter, enhancer, or other regulatory 
regions). Genes can be obtained from a variety of sources, including cloning from a source 
of interest or synthesizing from known or predicted sequence information, and may include 
10 sequences designed to have desired parameters. 

Generally, the nomenclature used hereafter and the laboratory procedures in cell culture, 
molecular genetics, molecular biology, nucleic acid chemistry, and protein chemistry 
described below are those well known and commonly employed by those of ordinary skill in 

15 the art. Standard techniques, such as described in Sambrook et al., Molecular Cloning - A 
Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1989 (hereinafter "Sambrook") and Current Protocols in Molecular 
Biology, F. M. Ausubel et a!., eds., Current Protocols, a joint venture between Greene 
Publishing Associates, Inc. and John Wiley & Sons, Inc. (1994, supplemented through 

20 1999) (hereinafter "Ausubel"), are used for recombinant nucleic acid methods, nucleic acid 
synthesis, cell culture methods, and transgene incorporation, e.g., electroporation, 
injection, gene gun, impressing through the skin, and lipofection. Generally, 
oligonucleotide synthesis and purification steps are performed according to specifications. 
The techniques and procedures are generally performed according to conventional 

25 methods in the art and various general references, which are provided throughout this 
document. The procedures therein are believed to be well known to those of ordinary skill 
in the art and are provided for the convenience of the reader. 

As used herein, an "antibody, Ab" refers to a protein comprising one or more polypeptides 
30 substantially or partially encoded by immunoglobulin genes or fragments of 

immunoglobulin genes. The term antibody is used to mean whole antibodies and binding 
fragments thereof. The recognized immunoglobulin genes include the kappa, lambda, 
alpha, gamma, delta, epsilon and mu constant region genes, as well as myriad 
immunoglobulin variable region genes. Light chains are classified as either kappa or 
35 lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn 
define the immunoglobulin classes, IgG (1-4), IgM, IgA, IgD and IgE, respectively. A 
typical immunoglobulin (e.g., antibody) structural unit comprises a tetramer. Each 
tetramer is composed of two identical pairs of polypeptide chains, each pair having one 
"light" (about 25 KDa) and one "heavy" chain (about 50-70 KDa). The N-terminus of each 
40 chain defines a variable region of about 100 to 110 or more amino acids primarily 

responsible for antigen recognition. The terms variable light chain (VL) and variable heavy 
chain (VH) refer to these light and heavy chains, respectively. 
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Antibodies exist as intact immunoglobulins or as a number of well-characterized fragments 
produced by digestion with various peptidases. Thus, for example, pepsin digests an 
antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a dimer of Fab 
which itself is a light chain joined to VH-CH1 by a disulfide bond. The F(ab)'2 may be 
5 reduced under mild conditions to break the disulfide linkage in the hinge region thereby 
converting the (Fab')2 dimer into an Fab' monomer. The Fab' monomer is essentially an 
Fab with part of the hinge region. The Fc portion of the antibody molecule corresponds 
largely to the constant region of the immunoglobulin heavy chain, and is responsible for 
the antibody's effector function (see, Fundamental Immunology, W.E. Paul, ed., Raven 

10 Press, N.Y. (1993), for a more detailed description of other antibody fragments). While 
various antibody fragments are defined in terms of the digestion of an intact antibody, one 
of skill will appreciate that such Fab 1 fragments may be synthesized de novo either 
chemically or by utilizing recombinant DNA methodology. Thus, the term antibody, as used 
herein also includes antibody fragments either produced by the modification of whole 

15 antibodies or synthesized de novo using recombinant DNA methodologies. 

Antibodies also include single-armed composite monoclonal antibodies, single chain 
antibodies, including single chain Fv (sFv) antibodies in which a variable heavy and a 
variable light chain are joined together (directly or through a peptide linker) to form a 
20 continuous polypeptide, as well as diabodies, tribodies, and tetrabodies (Pack et al. (1995) 
3 Mol Biol 246:28; Biotechnol 11:1271; and Biochemistry 31:1579). The antibodies are, 
e.g., polyclonal, monoclonal, chimeric, humanized, single chain, Fab fragments, fragments 
produced by an Fab expression library, or the like. 

25 The term "epitope" means a protein determinant capable of specific binding to an antibody. 
Epitopes usually consist of chemically active surface groupings of molecules such as amino 
acids or sugar side chains and usually have specific three dimensional structural 
characteristics, as well as specific charge characteristics. Conformational and 
nonconformational epitopes are distinguished in that the binding to the former but not the 

30 latter is lost in the presence of denaturing solvents. 

An "antigen-binding fragment" of an antibody is a peptide or polypeptide fragment of the 
antibody that binds an antigen. An antigen-binding site is formed by those amino acids of 
the antibody that contribute to, are involved in, or affect the binding of the antigen. See 
35 Scott, T.A. and Mercer, E.L, Concise Encyclopedia: Biochemistry and Molecular Biology (de 
Gruyter, 3d ed. 1997), and Watson, J.D. et al., Recombinant DNA (2d ed. 1992) 
[hereinafter "Watson, Recombinant DNA"], each of which is incorporated herein by 
reference in its entirety for all purposes. 

40 The term "screening" describes, in general, a process that identifies optimal molecules of 
the present invention, such as, e.g., the polypeptides and fragments and variants thereof, 
and related fusion polypeptides and proteins including the same, and nucleic acids 
encoding all such molecules. Several properties of these respective molecules can be used 
in selection and screening, for example: an ability of a respective molecule to bind a ligand 
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or to a receptor, to inhibit ceil proliferation, , to alter an immune response, e.g., induce or 
inhibit a desired immune response, in a test system or an in vitro, ex vivo or in vivo 
application. In the case of antigens, several properties of the antigen can be used in 
selection and screening including antigen expression, folding, stability, immunogenicity 
5 and presence of epitopes from several related antigens. 

"Selection" is a form of screening in which identification and physical separation are 
achieved simultaneously by, e.g., expression of a selection marker, which, in some genetic 
circumstances, allows cells expressing the marker to survive while other cells die (or vice 

10 versa). Screening markers include, for example, luciferase, beta-galactosidase and green 
fluorescent protein, and the like. Selection markers include drug and toxin resistance 
genes, and the like. Another mode of selection involves physical sorting based on a 
detectable event, such as binding of a iigand to a receptor, reaction of a substrate with an 
enzyme, or any other physical process which can generate a detectable signal either 

15 directly (e.g., by utilizing a chromogenic substrate or Iigand) or indirectly (e.g., by reacting 
with a chromogenic secondary antibody). Selection by physical sorting can by 
accomplished by a variety of methods, such as by FACS in whole cell or microdroplet 
formats. 

20 An "exogenous" nucleic acid," "exogenous DNA segment," "heterologous sequence," or 
"heterologous nucleic acid," as used herein, is one that originates from a source foreign to 
the particular host cell, or, if from the same source, is modified from its original form. 
Thus, a heterologous gene in a host cell includes a gene that is endogenous to the 
particular host cell, but has been modified. Modification of a heterologous sequence in the 

25 applications described herein typically occurs through the use of recursive sequence 

recombination. The terms refer to a DNA segment which is foreign or heterologous to the 
cell, or homologous to the cell but in a position within the host ceil nucleic acid in which 
the element is not ordinarily found. Exogenous DNA segments are expressed to yield 
exogenous polypeptides. 

30 

The term "nucleic acid" refers to deoxyribonucleotides or ribonucleotides and polymers 
thereof in either single- or double-stranded form. Unless specifically limited, the term 
encompasses nucleic acids containing known analogues of natural nucleotides which have 
similar binding properties as the reference nucleic acid and are metabolized in a manner 

35 similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic 
acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., 
degenerate codon substitutions) and complementary sequences and as well as the 
sequence explicitly indicated. Specifically, degenerate codon substitutions may be achieved 
by generating sequences in which the third position of one or more selected (or all) codons 

40 is substituted with mixed-base and/or deoxyinosine residues (Batzer et al. (1991) Nucleic 
Acid Res 19:5081; Ohtsuka et al. (1985) 3 Biol Chem 260:2605-2608; Cassol et al. (1992) 
; Rossolini et al. (1994) Mo! Cell Probes 8:91-98). The term nucleic acid is used 
interchangeably with gene, cDNA, and mRNA encoded by a gene. 
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"Nucleic acid derived from a gene" refers to a nucleic acid for whose synthesis the gene, or 
a subsequence thereof, has ultimately served as a template. Thus, an mRNA, a cDNA 
reverse transcribed from an mRNA, an RNA transcribed from that cDNA, a DNA amplified 
from the cDNA, an RNA transcribed from the amplified DNA, etc., are all derived from the 
5 gene and detection of such derived products is indicative of the presence and/or 
abundance of the original gene and/or gene transcript in a sample. 

A nucleic acid is "operably linked" when it is placed into a functional relationship with 
another nucleic acid sequence. For instance, a promoter or enhancer is operably linked to a 

10 coding sequence if it increases the transcription of the coding sequence. Operably linked 
means that the DNA sequences being linked are typically contiguous and, where necessary 
to join two protein coding regions, contiguous and in reading frame. However, since 
enhancers generally function when separated from the promoter by several kilobases and 
intronic sequences may be of variable lengths, some polynucleotide elements may be 

15 operably linked but not contiguous. 

The term "identical" or "identity," in the context of two or more nucleic acid or polypeptide 
sequences, refers to two or more sequences or subsequences that are the same or have a 
specified percentage of amino acid residues or nucleotides that are the same, when 
20 compared and aligned for maximum correspondence, as measured using one of the 
following sequence comparison algorithms or by visual inspection. 

"Sequence identity" is a measure of identity between polypeptides at the amino acid level 
and a measure of identity between nucleic acids at nucleotide level. The protein sequence 
25 identity may be determined by comparing the amino acid sequence in a given position in 
each sequence when the sequences are aligned. Similarly, the nucleic acid sequence 
identity may be determined by comparing the nucleotide sequence in a given position in 
each sequence when the sequences are aligned 

30 To determine the percent identity of two amino acid sequences or of two nucleic acids, the 
sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in 
the sequence of a first amino acid or nucleic acid sequence for optimal alignment with a 
second amino or nucleic acid sequence). The amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide positions are then compared. When a 

35 position in the first sequence is occupied by the same amino acid residue or nucleotide as 
the corresponding position in the second sequence, then the molecules are identical at that 
position. The percent identity between the two sequences is a function of the number of 
identical positions shared by the sequences (i.e., % identity # of identical positions/total # 
of positions (e.g., overlapping positions) x 100). In one embodiment the two sequences 

40 are the same length. 



Alignment of two sequences for the determination of percent identity is to be accomplished 
by using a mathematical algorithm published (Tatusova and Madden, 1999). BLAST 
nucleotide alignments is be performed with the blastn program, with the parameters 
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"Reward for a match" = 1, "Penalty for a mismatch" = -2, "Strand option" = both strands, 
"Open gap" = 5, "Extension gap" = 2, "gapx_dropoff" = 50, "expect" = 10.0, "word size" = 
11 and "Fliter" = on. 

5 BLAST protein searches can be performed with the biastp program applying the 
"BLOSUM26" matrix, with the parameters "Reward for a match" = 1, "Penalty for a 
mismatch" = -2, "Open gap" = 11, "Extension gap" = 1, "gapx_dropoff" = 50, "expect" = 
10.0, "word size" = 3 and "Fliter" = on. 

10 Both programs can be accessed from National Center for Biotechnological Information's 
web page at http://www.ncbi.nlm.nih.gov/blast/bl2seq/bl2.html 

In all polypeptide or amino acid based embodiments of the invention the percentage of 
sequence identity between one or more sequences is based on alignment of the respective 

15 sequences as performed by clustalW software (http:/www. ebi.ac.uk/ciustalW/index. html) 
using the default settings of the program. These settings are as follows: Aiignment=3Dfull, 
Gap Open 10.00, Gap Ext. 0.20, Gap separation Dist. 4, Protein weight matrix: Gonnet. 
With respect to the nucleotide-based embodiments of the invention, the percentage of 
sequence identity between one or more sequences is also based on alignments using the 

20 clustalW software with default settings. For nucleotide sequence alignments these settings 
are: Alignment=3Dfull, Gap Open 10.00, Gap Ext. 0.20, Gap separation Dist. 4, DNA 
weight matrix: identity (IUB). 

The term "serum" is used in its normal meaning, i.e. as blood plasma without fibrinogen 
25 and other clotting factors. The term "plasma" is used in its normal meaning, i.e. as blood 
plasma. Both terms are use interchangeable. 

In the present context "complementary sequence" refers to nucleotide sequences, which 
will hybridise to a nucleic acid molecule of the invention under stringent conditions. The 

30 term "stringent conditions" in refers to general conditions of high stringency. The term 
"stringency" is well known in the art and is used in reference to the conditions 
(temperature, ionic strength and the presence of other compounds such as organic 
solvents) under which nucleic acid hybridisations are conducted. With "high stringency" 
conditions, nucleic acid base pairing will occur only between nucleic acid fragments that 

35 have a high frequency of complementary base sequences, as compared to conditions of 
"weak" or "low" stringency. 

As an example, high stringency hybridisation conditions comprise (1) low ionic strength 
and high temperature for washing, such as 0.015 M NaCI/0.0015 M sodium citrate, pH 7.0 
40 (O.lxSSC) with 0.1% sodium dodecyl sulfate (SDS) at 50°C; (2) hybridisation in 50% 
(vol/vol) formamide with 5 x Denhardt's solution (0.1% (wt/vol) highly purified bovine 
serum aibumin/0.1% (wt/vol) Ficoll/0.1% (wt/vol) polyvinylpyrrolidone), 50 mM sodium 
phosphate buffer at pH 30.5 and 5 x SSC at 42°C; or (3) hybridisation in 50% formamide, 
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5 x SSC, 50 mM sodium phosphate (pH 30.8), 0.1% sodium pyrophosphate, 5 x 
Denhardt's solution, sonicated salmon sperm DNA (50 pg/ml), 0.1% SDS, and 10% 
dextran sulfate at 42°C with washes at 42°C in 0.2 x SSC and 0.1% SDS. 

5 The term "effective amount" refers to an amount or concentration of a substance such as 
an amino acid sequence, nucleotide sequence or an antibody, which is effective to produce 
a protective prophylactic or therapeutic response with respect to the disease malaria. In 
general, an effective amount of the substance, which is administered to a human subject, 
will vary depending upon a number of factors associated with that subject, including 

10 whether the subject has previously been exposed to Plasmodium falciparum. The person of 
ordinary skill in the art can determine an effective amount of the substance by varying the 
dosage of the product and measuring the resulting cellular and humoral immune and/or 
therapeutic responses subsequent to administration. In particular, the concentration range 
of an immunogenic substance is chosen so as to enhance the likelihood of eliciting an 

15 immunogenic response e.g. vaccinating the recipient for a long period of time, without 
causing a malaria infection in the vaccine recipient. 

"Endemic areas" refers to areas where transmission of P. falciparum parasites occurs 
repeatedly over the years. Depending on the intensity of transmission, endemic areas are 
20 often divided (in order of decreasing intensity) into holo- (intense, perennial transmission), 
hyper- (intense, seasonal transmission), meso- (less intense, locally and temporally 
varying transmission), hypo-endemic (little transmission with little effect at the population 
level) areas. 

25 A "B-cell epitope" is defined as an antigenic determinant, which functionally is the portion 
of an antigen, which combines with the antibody paratope. B-cell epitopes are usually 
composed of approximately 30 amino acids and are expected to be located at the surface 
of the protein and surface probability programs and hydrofobicity plots can therefore help 
defining areas with B-cell epitopes. With respect to the present invention the Protean 4.0 

30 software in the DNAstar package is used with default settings when defining such areas. 
Specific B-cell epitopes should preferably be determined experimentally, which can be 
done by methods well known to the person of ordinary skill in the art. 

In the present context the term "DNA vaccine" refers to vaccines based on any species of 
35 nucleic acid molecules, comprising species of DNA or RIMA. 

The term "T cell epitope" refers to a sequence of about ten amino acids that are part of a 
much longer, folded chain of amino acids and can lead to activation of a T-cell when 
presented on the surface of a cell in complex with Major Histocompatibility Complex II 
40 (MHC) and/or MHCI. Probability values for putative T-cell epitopes within a polypeptide 
may be obtained with the use of computers, neural networks and prediction servers such 
as SYFPEITHI server at Centre for Biological Sequence Analysis BioCentrum-DTU, Technical 
University of Denmark ( http://svfpeithi.bmi-heidelbera.com/Scripts/MHCServer.dll/ 
EpPredict.htirO , which is used with, default unchangeable settings. 
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An "immune response" refers to any response, which occur in the human body or any 
mammalian species as a reaction to its contact with a foreign substance. An immune 
response can for example cause activation of B-lymphocytes and/or T-cells. Activation of 
5 B-lymphocytes can result in production of antibodies that can target said foreign substance 
or antigen. Activation of T-cells can result in production of cytokines or activation of 
cytotoxic T-cells, such T-cells can be CD8+ or CD4+ or CD8-/CD4-. Activation of an 
immune response can furthermore result in activation of macrophages, NK cells and/or 
result in the production of specific T- and B- memory cells 

10 

The term 'fusion protein' is to be interpreted as the product of a SEQ ID NO.: 1, SEQ ID 
NO.: 3, SEQ ID NO.: 5 and/or SEQ ID NO.:7 nucleic acid sequence to which an exogenous 
nucleic acid sequence that may be of virtually any length has been added. 

15 "In vitro panning" refers to a procedure by which erythrocytes infected by a particular 
isolate/line/clone of P. falciparum is selected for dominant expression of a variant surface 
antigen (VSA) with defined adhesion characteristics. To select for expression of VSA that 
can adhere to human endothelial cells in vitro by in vitro panning, erythrocytes infected by 
mature stages of the isolate/line/clone in question are allowed to adhere to culture dishes 

20 previously containing human endothelial ceils. Unbound (non-adhering) erythrocytes are 
removed by washing, and only the remaining bound (adhering) are used to propagate the 
isolate/line/clone further. The process of in vitro panning is usually repeated at a minimum 
of three times to ensure uniform expression of the VSA with the desired adhesion 
characteristics. 

25 

"Expression systems" refers to eucaryotic and/or prokaryotic systems for expression of 
VAR4, VAR5, and/or VAR6 protein or homologues hereof. The DNA sequence that forms 
the basis for expression in these systems may be either non-recodonised and/or 
recodonised such as SEQ ID NO.: 7. As an example the sequence could be optimised for 

30 expression in different yeast systems, in vitro systems using human cells, and/or insect 
cell systems. In such systems it would be of advantage to purify the protein before using it 
therapeutically and/or as a vaccine. In another example, the sequences could be optimised 
for expression in plant-derived systems. Such whole transgenic plants might be ingested to 
activate the immune system against parasites causing severe malaria, or the proteins 

35 could be purified from such transgenic plants. Plant expression systems could be, but are 
not limited to transgenic potatoes, Soya been, tobacco, banana, and/or crops used for 
animal feeding that can be made transgenic with known methods. SEQ ID NO.: 1, SEQ ID 
NO.: 3, SEQ ID NO.: 5 and/or SEQ ID NO.: 7 or homologues hereof can be delivered to 
plants by different means. As an example of delivery DNA can be transferred by 

40 Agrobacterium T-DNA vectors or by shooting the DNA inside the nucleus of the plant cell. 
Transient expression can be obtained with different virus vectors transfecting the plant 
cell. 
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The term 'nucleic acid molecule' refers to an oligomer or polymer of ribonucleic acid (RNA) 
or deoxyribonucleic acid (DNA) or mimetics thereof. This term includes molecules 
composed of naturally-occurring nucleobases, sugars and covalent internucleoside 
(backbone) linkages as well as molecules having non-naturally occurring nucleobases, 
5 sugars and covalent internucleoside (backbone) linkages which function similarly or 

combinations thereof. Such modified or substituted nucleic acids are often preferred over 
native forms because of desirable properties such as, for example, enhanced cellular 
uptake, enhanced affinity for nucleic acid target and increased stability in the presence of 
nucleases and other enzymes, and are in the present context described by the terms 
10 "nucleic acid analogues" or "nucleic acid mimics". Preferred examples of nucleic acid 
mimetics are peptide nucleic acid (PIMA-), Locked Nucleic Acid (LNA-) , xylo-LNA-, 
phosphorothioate-, 2'-methoxy-, 2'-methoxyethoxy-, morpholino- and phosphoramidate- 
containing molecules or the like. 

15 

By 'real time quantitative PCR' is meant a method including a fluorescent DNA intercalating 
dye in a PCR reaction mix. This method measures incorporated fluorescens at the end of 
each cycle making it possible to calculate the copy number of mRNA molecules in the 
original starting sample. 

20 

By "Enzyme-linked-immunosorbent assay (ELISA)" is meant an assay for determining the 
amount, level or titre of protein, antigen or antibody in a given sample by means of an 
enzyme-catalysed colour reaction. One variant of ELISA is the two-antibody "sandwich" 
ELISA. This assay is used to determine the antigen concentration in unknown samples. The 

25 assay is done by coating a microtiter plate with antibody, antigen is then added and 

allowed to complex with the bound antibody. Unbound products are removed by washing, 
and a labelled secondary antibody (the "detection" antibody) is allowed to bind to the 
antigen, thus completing the "sandwich". The assay is then quantitated by measuring the 
amount of labelled secondary antibody bound to the antigen, through the use of a 

30 colorimetric substrate. In a second variant of the ELISA method, plates are coated with 
antigen and specific antibodies are used for the detection by incubating the plate with a 
biological fluid. Unbound antibodies are then removed by washing, and a labelled 
secondary antibody (the "detection" antibody) is allowed to bind to the primary antibody. 
The level of binding between antigen and antibody is then quantitated by measuring the 

35 amount of labelled secondary antibody bound to the matrix, through the use of a 
colorimetric substrate 

By "radioimmunoassay (RIA)" is meant a method that utilises radiolabeled Antibody (Ab) 
or Antigen (Ag) to detect Ag:Ab reactions. The Abs or Ags are labelled with 125 I (iodine- 
40 125) and the presence of Ag:Ab complexes are detected using a gamma counter. RIA can 
be done in solution as well on filters. In solution the Ag:Ab complexes are precipitated and 
the amount of radioactivity in the supernatant measured. 



WO 2005/063804 PCT/DK2004/000922 

68 



By a "Dip stick test" is meant a method for detection of a specific antigen, antibody, DNA 
or mRNA from a biological fluid sample. A nucleic acid, antigen or antibody is bound to the 
membrane of the dip stick and contact to a labelled or unlabelled bodily fluid is allowed for 
a given time. The nucleic acid, antigen or antibody bound on the membrane can then be 
5 hybridised to nucleic acid, antigen or antibody labelled with a dye. 

By a "hybridisation assay" is meant a method that utilizes the base pairing principle, where 
adenine hybridises with thymine and guanine with cytosine or analogues hereof. Biological 
(e.g. tissue, blood, or serum) samples can be tested for the presence of RNA or DNA by 
10 hybridisation with a probe, labelled or unlabelled, solid phase or liquid phase. 

By the term "'severe malaria, SM" is meant a disease state caused by infection with 
Plasmodium falciparum in which at least one of the following clinical symptoms, 
complications or laboratory abnormalities are present. Clinical symptoms/complications: 
15 Prostration, impaired consciousness, respiratory distress, acidotic breathing, multiple 
convulsions, circulatory collapse, pulmonary oedema, abnormal bleeding, jaundice, 
haemoglobinuria. Laboratory findings: B-haemoglobin <5g/dl_; B-glucose <2.2mmol/L; P- 
bicarbonate <15mmol/L; P-lactate >5mmoI/L; P-creatinine >265pmol/L; parasitaemia 
>4%. 

20 

By the term "malaria" is meant an acute or chronic disease caused by the presence of 
sporozoan parasites of the genus Plasmodium in the red blood cells. 

By the term "semi-immune" is meant an individual who have been exposed to Plasmodium 
25 falciparum infection and developed an immune response to the parasite. The immune 
response is not necessarly fully protective. Thus, a semi-immune subject is protected 
against severe malaria, but might still be infected with malaria parasites or in risk of 
becoming infected with parasites causing non-severe malaria. 

30 By "in vitro diagnosis" is meant detection of P falciparum derived compounds related to 

SEQ ID NO.: 1, SEQ ID NO.: 2, SEQ ID NO. : 3, SEQ ID NO.: 4, SEQ ID NO.: 5, and/or SEQ 
ID NO.: 6 in a biological fluid or sample. These SEQ ID NO.: 1, SEQ ID NO.: 3, and/or SEQ 
ID NO.: 5 related compounds can for example be mRNA, DNA, or for SEQ ID NO.: 2, SEQ 
ID NO.: 4, and/or SEQ ID NO.: 6 protein-antigen, peptide-antigen or antibody being of any 

35 subclass. The methods for in vitro diagnosis of severe malaria could be, but are not limited 
to PCR, RT-PCR, real-time quantitative PCR, ELISA, RIA, Dip stick test or any Hybridisation 
assay. 

By "VSA SM " is meant a limited and conserved set of VSA that are both stronger and more 
40 commonly recognised by IgG in the plasma of malaria-exposed young semi-immune 

individuals than VSA (VSA UM ) expressed by parasites causing uncomplicated malaria (UM) 
in older semi-immune children. To test for VSA SM expression on the surface of erythrocytes 
infected by a P. falciparum /solate/line/clone, the level of specific recognition of VSA 
expressed by the isolate/line/clone in question is tested again a panel of plasma by flow 
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cytometry. (Staaisoe et al. 1999, Staalsoe et al. 2003). Plasma is collected from children 
(aged 1-10 years) living in an area of very high malaria transmission intensity. The 
children are classified as responders or non-responders according to the reactivity of their 
plasma in comparison with that of plasma collected from individuals living in areas where 
5 malaria transmission never occurs. The percentage of responders in age groups defined by 
year is plotted against age. The tested parasite isolate/line/clone is classified as a VSA SM if 
the percentage of responders is higher than 80% in 2-year-old children. A parasite 
isolate/line/clone is classified as VSA UM if the percentage of responders aged 2 years is 
lower than 50% and increases steadily by age. 

10 

By "VSA UM " is meant less well conserved set of VSA that are less well and less commonly 
recognised by IgG in the plasma of malaria-exposed young semi-immune individuals, but 
well and more commonly recognised by immune adults from malaria endemic areas. To 
test for VSA UM expression on the surface of erythrocytes infected by a P. falciparum 

15 /solate/line/clone, the level of specific recognition of VSA expressed by the 

isolate/line/clone in question is tested again a panel of plasma by flow cytometry. 
(Staalsoe et al. 1999, Staalsoe et al. 2003). Plasma is collected from children (aged 1-10 
years) living in an area of very high malaria transmission intensity. The children are 
classified as responders or non-responders according to the reactivity of their plasma in 

20 comparison with that of plasma collected from individuals living in areas where malaria 
transmission never occurs. The percentage of responders in age groups defined by year is 
plotted against age. The tested parasite isolate/line/clone is classified as a VSA SM if the 
percentage of responders is higher than 80% in 2-year-old children. A parasite 
isolate/line/clone is classified as VSA UM if the percentage of responders aged 2 years is 

25 lower than 50% and increases steadily by age. 

With respect to the present invention the term 'polypeptide' refers to an amino acid chain 
of any length, including a full-length protein, oligopeptides, short peptides and fragments 
thereof, wherein the amino acid residues are linked by covalent bonds. 

30 

"Isolated" and "purified": The term "isolated" requires the material to be removed from the 
environment in which it was present originally. For example, a polypeptide or nucleic acid, 
which is expressed in a cell, is not isolated. However, the same polypeptide or nucleic acid, 
when separated from some or all of the coexisting material occurring in the original 

35 environment, will be considered as isolated. It is in accordance with this definition to 

regard polypeptides and nucleic acids present in cell lysates as isolated. By "purifying"' a 
compound such as a polypeptide or a nucleic acid is meant increasing the degree of purity 
of a preparation of the compound by removing completely or partially at least one 
contaminant from the preparation. When applied to a preparation of a compound the term 

40 "degree of purity" refers to its relative content by weight of the compound of interest, 
based on the total weight of the preparation. The degree of purity of a compound may be 
within the range of 1 - 100%, such as from 1 - 100%, 10 - 100%, 20 - 100%, 30 - 100%, 
40 - 100%, 50 - 100%, 300 - 100%, 70 - 100%, 80 - 100% and 90 - 100%. 'Substantially 
pure' is herein used to describe a polypeptide or a nucleic acid with a degree of purity of at 
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least 70%, such as at least 75%, at least 80%, at least 85%, at least 90% at least 95%, 
at least 99% or preferably substantially pure from other components. The % value herein 
indicates % (w/w). 

5 Figure Legends 
Figure 1 

Quantitative fluorometric measurements (ml -3) of plasma Ab recognition of VSA 
expressed by P. falciparum isolates. Parasites (columns) were obtained from 68 pediatric 
patients from Ghana (parasite donors). Plasma samples (rows) were obtained from 96 

10 healthy children from the same area as the patients. Small squares represent specific 
parasite/plasma combinations. For each such combination, Ab levels are indicated by the 
shading of the square (ml-4). The healthy plasma donors (rows) are sorted by age, and 
within each of the two clinical categories the parasite isolates (columns) are sorted 
according to the age of the parasite donors (malaria patients). Within parasite donor age 

15 groups, individual isolates are sorted according to level of VSA IgG recognition (sum of 
scores). Small numbers along the right and bottom edges are for enumeration of plasma 
samples and parasites, respectively. 

Figure 2 

20 Age dependency of Ab recognition of VSA expressed by 68 P. falciparum isolates. For each 
parasite/plasma combination, the Ab recognition was scored on a six-level scale, according 
to Ab recognition of the isolate by 2-fold dilutions of a pool of plasma from adult, parasite- 
exposed Ghanaians. The overall Ab recognition of individual isolate was subsequently 
calculated as the sum of scores obtained with each of the 96 plasma samples. The 

25 dependency upon the age of the parasite donors (malaria patients) is shown in A and B, 
whereas the dependency upon the age of the healthy plasma donors is shown in C and D. 
Parasite isolates obtained from patients with severe P. falciparum malaria are shown in A 
and C, whereas parasites from patients with nonsevere malaria are shown in B and D. In 
all panels, means and 95% confidence intervals are indicated. 

30 

Figure 3 

Distribution of patient age (A) and VSA Ab fluorescence sum of scores (B) according to 
cluster assignment of the infecting P. falciparum isolate. Median (center line), 25th and 
75th percentiles (box), 10th and 90th percentiles (vertical lines), and outliers (•) are 
35 shown for each cluster. 

Figure 4 

Plasma Ab recognition of VSA expressed by parasite isolates obtained from 68 Ghanaian 
children with P. falciparum malaria. The level of fluorescence (mean and 95% confidence 
40 intervals) obtained with 2-fold dilutions of a plasma pool from parasite-exposed adult 
Ghanaians is shown, including the regression line (solid line) and its 95% confidence 
interval (dashed lines). A, Ab recognition of isolates according to their origin from patients 
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with severe (•) or nonsevere (O) P. falciparum malaria. B, Ab recognition of isolates 
according to their origin from young (•; 3-4 years of age) or older (O; 5-11 years of age) 
P. falciparum malaria patients. 

5 Figure 5 

Levels of IgG with specificity for variant surface antigens in P. falciparum 3D7 before 
(panels A-B) and after (C-D) in vitro selection by iRBC reactivity with IgG in the plasma of 
semi-immune Ghanaian children. Open histograms show VSA-specific IgG reactivity in 
plasma from semi-immune Ghanaian children (A, C) or clinically immune Ghanaian adults 
10 (B, D). Shaded histograms show corresponding reactivity in plasma from donors without P. 
falciparum exposure (negative control). 

Figure 6 

An in vitro culture of ethidium bromide-labelled P. falciparum 3D7-infected RBC following 
15 sequential exposure to the SMI plasma pool, biotinylated secondary anti-human IgG, and 
streptavidin-coated DynaBeads (m3-l, m3-2). Uninfected RBC (white left-arrows), and 
iRBC coated (white down-arrows) or not coated (white up-arrows) with DynaBeads, as well 
as free DynaBeads (black arrows) can be seen in the micrograph. 

20 Figure 7 

Plasma levels of IgG with specificity for variant surface antigens expressed by 3D7, 3D7 S mi, 
and five P. falciparum field isolates from Sudan. Plasma was obtained from Sudanese (A) 
and Tanzanian (B) individuals, and levels are given on a semi-quantitative scale 
determined by recognition of intact iRBC by IgG in plasma pools from non-exposed donors 
25 (negative control pool) and highly exposed donors (positive control pool), respectively. IgG 
reactivity in plasma from the five parasite donors to the autologous P. falciparum isolates 
are indicated by heavy lines. 

Figure 8 

30 Adhesion of 3D7- and 3D7 SM i-infected RBC to wild-type (CHO-0) and CD36-transfected 
(CHO-CD36) Chinese hamster ovary (CHO) cells (m4-l). 

Figure 9 

A) Schematic presentation of all 3D7 var gene sequence analyses (m2-l, m2-2, m2-3). 

35 Gene names, chromosomal location, transcriptional direction and domain structure are 

shown along with the cluster to which each gene was assigned by the sequences analyses. 
Sequences that could not be assigned to any cluster were named X. Three major var gene 
groups (group A-C), two intermediate groups group B/A and group B/C and two unique 
genes representing varl and var2 var gene families were defined (framed). B) Sequence 

40 analyses of var genes from other P. falciparum strains than 3D7. Protein accession 

numbers, originating strain, domain structure and the closest related 3D7 var 5' sequence 
are shown along with sequence group allocations as defined in 3D7. *) The genes were 
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assigned to group A, as their DBLla sequences clustered together with other group A 
sequences in analysis of DBLa sequences. **) Pseudogene, belongs to the varl family ^) 
Upstream sequences with atypically low similarity to upsB or upsC sequences. 

5 Figure 10 

Schematic representation of head-to-head genomic organisation of rif and upsA flanked 
var genes. Nine genes are flanked by a rif gene, which has its initiation codon 
approximately 3 or 4 kb upstream from the var initiation codon, and one var gene by 
another var gene at -2 kb. Punctured lines represent upsA, dotted lines upsA-r/f and full 
10 line upsBsh. The diamond marks the putative termination site of upsA characterised by a 
stretch of TA repeats. Sizes of genes are not in scale. 

Figure 11 

Distance tree of 3D7 var gene 500 bp 3 1 region generated using the p-distance/NJ method 
15 (m2-3). The four dense clusters A through D were supported by both bootstrapping and 
maximum parsimony (MP) tree (not shown). The relationship of the remaining sequences 
could not be verified by the MP tree making method. Numbers at the nodes represent 
bootstrap proportions (BP) on 1000 replicates. The scale bar represents the proportion of 
different nucleotide compared. PlasmoDB accession numbers are shown. Genes with 
20 assigned cluster are collected in Figure 9. 

Figure 12 

Distance tree of DBLaCIDRl domains of 3D7 PfEMPl and pseudogene PFE1640w generated 
using the p-distance/NJ method (m2-3). The clusters A through E were supported by both 
25 bootstrapping and maximum parsimony tree (not shown). Numbers at the nodes represent 
bootstrap proportions (BP) on 1000 replicates. The scale bar represents the proportion of 
different amino acids compared. PlasmoDB accession numbers are shown. 

Figure 13 

Fold changes in var gene transcription by late-stage P. falciparum 3D7 before and after 
30 antibody-selection of parasites for selection of VSA SM -type antigens using DynaBeads 
coated with plasma IgG from three different pools of plasma from semi-immune African 
children (SMI, SM2, SM3) (/7?3-2). Transcription levels were measured using real-time PCR 
and primers specific for 59 var genes and one pseudogene (Salanti et a!., 2003) (n?6-2). 
Panel A shows mean fold changes (± SD; 5 experiments) in var gene transcription related 
35 to selection using the SMI pool. Panel B shows fold change values in one experiment using 
the SM2 pool, while Panel C summarizes one experiment using the SM3 pool. Black bars 
indicate the 15 most highly transcribed var genes in selected and/or unselected 3D7. A 
3-fold change in var gene transcription (dashed lines) was arbitrarily defined as the cut-off 
for biologically significant changes in var gene transcription. Primers see Table 1. Grouping 
40 of var genes is as described in (Lavstsen et al., 2003). 
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Figure 14 

Changes in var gene transcription in synchronised ring-stage (30 hr) Plasmodium 
falciparum 3D7 in vitro selected with a pool of plasma antibodies from children from semi- 
immune African children (SMI, SM2) (se legend to Fig 13). Panel A shows mean fold 
5 changes (± SD; 5 experiments) in var gene transcription related to selection using the 
SMI pool. Panel B shows fold change values in one experiment using the SM2 pool. A fold 
change of 2 (dotted lines) was defined as the cut-off for biological interesting changes in 
var gene transcription. 

10 Figure 15 

Relative levels of var transcripts in 10 pg of total RNA obtained from 3D7 (lane 1) and 
antibody-selected 3D7(lane 2) ring stage iRBC. Northern blots (m8-l) probed with DIG 
labelled RNA probes targeting (Panel A) PFD1235w/MAL7Pl.l (SEQ ID NO.: 1). Transcript 
size is 12.5kb. (Panel B) Ethidium bromide-stained gel shown to allow comparison of total 
15 RNA amounts. 

Figure 1G 

Expression of PfEMPl in 3D7 and antibody-selected 3D7 trophozoite/schizont stage iRBC. 
Western blots (mi 1 -5, ml 1-6) of SDS-extracts obtained before (Lane A) and following 

20 (Lane B) antibody selection. Samples were run in 5% SDS-PAGE gels, blotted and probed 
with antibodies to a relatively conserved sequence of the intracellular acidic segment 
ATS/Exon2 (lanes A-B) and a specific antibody raised against DBL5-S of VAR4 (SEQ ID 
NO.: 2)( Lanes C-D). The antibody-selected parasite expressed the product of 
PFD 1 23Sw/MA L 7 PI . 1 (***)(SEQ ID NO. : 2) and a PfEMPl with molecular weight (MW) 

25 corresponding to the predicted MW of MAL6P1.316 (**). Both selected and unselected 
parasites expressed a PfEMPl of a MW corresponding to the predicted MW of PF08_0107 
(*). SDS-extracts of RBC infected by NF54 obtained from a Dutch volunteer (Hermsen et 
al., 2001). Extracts were obtained on day 8, 9, and 10 (Lanes E-G) and the blot probed 
with the DBL5-5 antibody used in Lanes C-D. 

30 

Figure 17 

Surface expression of antigens in P. falciparum 3D7 before (Al-5) and after (Bl-5) 
antibody selection {m3-2, mll-6). Murine plasma antibodies against recombinant 
DBL5-5of VAR4 (SEQ ID NO. : 2) reacted with the surface of most antibody-selected 3D7 

35 iRBC in FACS (Bl; green line) and fluorescence microscopy (B2; green dots). The 

antibodies only reacted with a minority of unselected 3D7 (Al, A2), and with none of the 
trypsin-treated iRBC from antibody-selected 3D7 (Bl; purple line). Shaded histograms 
show flow cytometry reactivity in pre-vaccination mouse plasma. VSA-specific IgG (A3, B3) 
reactivity in plasma from a semi-immune African child (red line) and a clinically immune 

40 African adult (blue line) confirming the VSA UM phenotype of unselected 3D7 and the VSA SM 
phenotype of antibody-selected 3D7. Localization of the VAR4 (SEQ ID NO.: 2) using 
confocal microscopy and murine plasma anti-DBL5-5 antibodies (A4, A5, B4, B5). 
Ethidiunn-bromide staining of DNA in the nuclei is red/orange and staining of VAR4 (SEQ ID 
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NO.: 2) using FITOIabeied antibodies is green. Pre-vaccination mouse plasma did not stain 
iRBC (data not shown). 

Figure 18 

5 Plasma antibody levels to recombinant DBL5-5 (A), CIDRl-a (B), DBL3-p (C), and NTS (D) 
domains of the VAR4 (SEQ ID NO.: 2) in Tanzanian children and adults, and in Danish 
donors without P. falciparum exposure (DK) (ml3-l). For competition ELISA experiments 
(ml3~2), plates were coated with recombinant CIDRl-a domains of the proteins encoded 
by PFD1 235W/MAL7P1 . 1 (SEQ ID NO.: 1 and SEQ ID NO.: 2) (E) and PFE1644w/t/arl (F), 
10 and ELISA reactivity measured in three different plasma samples, which had been pre- 
incubated with increasing concentrations of homologous or heterologous fusion protein as 
indicated in the top part of each panel. Non-depleted (ND) plasma was included for 
comparison. 

15 Figure 19 

Similarity of the PFD1 235w/MAL7Pl . 1 encoded protein (SEQ ID NO.: 2) homologues in 
genotypically distinct parasite isolates. Alignment of DBLl-a/CIDRl-a from two peripheral 
blood parasites from children (BM021 and BM48) and the protein product of 
PFD1 235w/MAL7Pl . 1 (SEQ ID NO.: 2) from 3D7 covering amino acids 842-1602 in 3D7 
20 SEQ ID NO. 2. Identical residues appear on black background, conserved amino acid 
changes on grey, and radical changes on white background. 

Figure 20 

Fold changes in var gene transcription in ring-stage (30 hr) (Panel A) and by late-stage 
25 (Panel B) P. falciparum 3D7 before and after in vitro selection on human endothelial cells 
for selection of parasites expressing VSA SM -type antigens {m7-l, m7-2). Transcription 
levels were measured using real-time PCR and primers specific for 59 var genes and one 
pseudogene (Salanti et al., 2003). The two panels shows fold change values in one 
experiment. Black bars indicate the 15 most highly transcribed var genes in selected 
30 and/or unselected 3D7. A 3-fold change in var gene transcription (dashed lines) was 
arbitrarily defined as the cut-off for biologically significant changes in var gene 
transcription. Primers see Table 1. Grouping of var genes is as described in (Lavstsen et 
al., 2O03). 

35 Figure 21 

Surface expression of VAR4 (SEQ ID NO.: 2) in P. falciparum 3D7 SM before and following 
selection for binding to human CD36. Rabbit antibodies against recombinant DBL5-5of 
VAR4 (SEQ ID NO.: 2) reacted with the surface of most antibody-selected 3D7 iRBC in 
FACS before selection (Panel A). The expression of VAR4 on the surface of iRBC decreased 
40 following 2x (Panel B) and 3x (Panel C) selection for binding. A complete loss of VAR4 
surface expression was seen following the 4x (Panel D), 5x (Panel E) and 7x (Panel F) 
selection for binding to human CD36. 
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Figure 22 

Non-VAR4 (SEQ ID NO.: 2) expressing 3D7 binds CD36, whereas VAR4 expressing 3D7 do 
not bind CD36. Panel A. 3D7 not expressing VAR4 on the surface. Panel B. 3D7 expressing 
VAR4 on the surface. Panel C. Binding of non-VAR4 expressing 3D7 to CD36 on CHO ceils. 
5 Panel D. Binding of VAR4 expressing 3D7 to CD36 on CHO cells. 

Figure 23 

Surface expression of VAR4 (SEQ ID NO. : 2) in P. falciparum 3D7 SM left to drift in culture. 
VAR4 is expressed on the surface of most antibody-selected 3D7 iRBC in FACS at timepoint 
10 zero (Panel A). The expression remained stable for at least 44 generations (Panel E) and 
did not decrease significantly until after 60 generations (Panel F). 

Examples 
Example 1: 

15 Erythrocytes infected by P. falciparum parasites causing severe malaria (SM) are stronger 
and more commonly recognised by IgG in plasma of malaria-exposed individuals than 
erythrocytes infected by other P. falciparum parasites 

P. falciparum-mfected red blood cells (iRBC) adhere to endothelial host receptors through 
20 parasite-encoded, clonally variant surface antigens (VSA). The VSA-mediated iRBC 

adhesion and the acquired VSA-specific antibody response is linked to disease severity. 
Parasites isolated from young children with severe malaria (SM) express a limited and 
conserved set of VSA (VSA SM ) that are both stronger and more commonly recognised by 
IgG in the plasma of malaria-exposed individuals than VSA (VSA UM ) expressed by parasites 
25 causing uncomplicated malaria (UM) in older semi-immune children. It is therefore likely 
that the SM-specific protective immunity acquired in young children in areas of intense 
parasite transmission is based on antibodies to VSA SM that inhibits the adhesion to 
endothelial cells of parasites expressing VSA SM on the surface of infected erythrocytes. 

30 Materials and methods 

ml-1. Isolation of iRBC from malaria patients: Circulating human erythrocytes infected 
with 

Plasmodium falciparum (iRBC) were collected in vacutainers containing either 
35 heparin or citrate-phosphate-dextrose (CPD) as anticoagulant. Plasma and white 

blood cells were removed upon centrifugation at 800xg, and the erythrocyte pellet 
resuspended in an equal volume of freezing solution (28%(v/v) glycerol in 
4.2%(w/v) sorbitol and 0.9%(w/v) NaCI in H 2 0) and snap-frozen in liquid Nitrogen. 
In vitro culture of P. falciparum parasites: Cryopreserved iRBC were restored by 
40 thawing at 37°C followed by washing in 3.5% NaCl 2 and washing twice in 

RPHI 1640 culture medium ( h ttp : //www . I if etech .com ). Parasites were maintained 
in a 5% suspension culture of uninfected human 0 + erythrocytes in RPMI 1640, 
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supplemented with Aibumax, hypoxanthin, glutamine, gentamycin (all 

http : //www. I if ete ch .com ), and non-immune human serum. Culture medium was 

changed and Giernsa-stained smears were prepared for microscopy on a daily 

basis. 

5 ml-2. Purification ofiRBC from cultures: iRBC with haemozoin-containing trophozoites and 
schizonts were purified from in vitro cultures (ml-1) by magnet-activated cell 
sorting (MACS; http://www.miltenyibiotec.com ), exploiting the magnetic properties 
of haemozoin. In short, iRBC were passed through a size-C MACS column mounted 
with a 0.9 mm x 40 mm needle. The column was washed with phosphate buffered 

10 saline (PBS) supplemented with 2% foetal calf serum (FCS; PBS-S) until no 

erythrocytes could be seen in the eluate. The column was removed from the 
magnet, and the trophozoite- and schizont-containing iRBC retained in the column 
were then released by further washing. A purity of trophozoite-/schizont-infected 
iRBC >90% was usually reached by this procedure . 

15 ml-3. Detection of human VSA-specific IgG: Purified iRBC {ml-2) were labelled with 1 \x\ 
ethidium bromide (EB; http://www.siqma-aldrich.com ) solution (0.1 mg/ml) per 
10 s erythrocytes to allow discrimination between nucleic acid-containing iRBC and 
uninfected erythrocytes devoid of DNA/RNA. For each sample, 2 x 10 5 erythrocytes 
in 100 \i\ PBS-S (ml-3) were used. EB-labelled iRBC were mixed with 1-5 jal of 

20 human plasma or antibody preparation, followed by goat anti-human IgG 

( http://www.dako.com ), diluted 1:200 and by fluorescein isothiocyanate (FITC)- 
conjugated rabbit anti-goat Ig ( http : //www . d a ko . co m ) , diluted 1:25. The 
antibodies were diluted in PBS-S, and 100 \i\ of the dilution was added per sample. 
At each step, samples were incubated for 30 min at 5°C. The samples were washed 

25 twice in 3 ml PBS-S between each incubation step and once after the last. Samples 

were kept overnight at 5°C before analysis on a Coulter EPICS XL-MCL flow 
cytometer ( http://beckman.com ). 
ml-4. For quantification of FITC fluorescence, the mean fluorescence intensity (MFI) of the 
ethidium bromide positive red blood cells was calculated using WinList software 

30 ( http://www.vsh .com ). Plasma from Danish donors never exposed to falciparum 

malaria did not label uninfected erythrocytes or iRBC above the level of the 
secondary and tertiary antibodies alone. In contrast, the plasma pool prepared 
from hyper-immune Ghanaians selectively labelled iRBC but not uninfected 
erythrocytes. 

35 ml-5. Human plasma samples tested: The individual human plasma samples were 
obtained from the following groups of individuals: 

Plasma from Danish adults without exposure to P. falciparum parasites were 
obtained at the Copenhagen University Hospital (Rigshospitalet) from laboratory 
staff and blood donors being screened for the presence of anti-RhD antibodies. 
40 Plasma from 96 Ghanaian children living in Dodowa Town, 50 km northeast of 

Accra. The area is characterized by hyperendemic, seasonal transmission ofR 
falciparum parasites. All children were healthy at the time of blood sampling. 
A pool of plasma from healthy, parasite-exposed adults from the village of Gomoa 
Onyadze, 80 km west of Accra (Nielsen et al. 2002). 
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To corroborate the above hypothesis levels of antibodies in plasma from 96 healthy 
children, aged 3-8 years, with specificity for each of the 68 parasite isolates (Fig. 1) were 
measured (ml-4). Overall, antibody (Ab) recognition of the parasite VSA differed widely 
5 among piasma donors. While plasma samples from some children contained barely 
detectable levels of Abs specific for VSA expressed by any of the isolates (e.g., plasma 
donors 15, 34, and 59; see Fig. 1), others had high levels and a broad range of VSA- 
specific Ab (e.g., plasma donors 14, 41, and 84; Fig. 1). By analyzing the parasite-specific 
sum of scores from the 96 x 68 recognition matrix, we found that recognition of VSA was 

10 independently associated with both the age of the malaria patient (3-4, 5-30, and 7-11 
years; p = 0.005) and the clinical picture (severe or nonsevere; p = 0.006, by two-factor 
ANOVA). There was no significant interaction between these two sources of variation (p = 
0.26). Pair wise multiple comparison procedures (Tukey's post-hoc test) showed that while 
the VSA sum of scores of parasites from the youngest patient group was significantly 

15 different from that from either of the two other age groups (p < 0.05), the latter two were 
not significantly different from each other (p >0.05). The relationship between severity and 
age of the parasite donor is illustrated in Fig. 2, A and B. These results show that Ab 
recognition of parasite VSA was independently affected by both the age and the clinical 
severity of the malaria patient from whom the parasite was obtained. This is important, 

20 because disease severity is inversely correlated with age in areas of endemic parasite 
transmission, which in all likelihood reflects age-dependent acquisition of protective 
immunity (reviewed In Riley et al. 1994). Protective immunity appears to involve 
acquisition of Ab responses to a broad range of VSA (Bull et al. 1998), and consistent with 
this observation we found that VSA Ab levels correlated with the age of the healthy piasma 

25 donors (Fig. 2, C and D). The authenticity of our finding of independent effects of patient 
age and disease severity is supported by the fact that the age distributions of children with 
severe and nonsevere disease were similar (by t test, p = 0.5) in the present study (5.30 ± 
0.4 and 30.0 ± 0.5 years, respectively; mean ± SD), due to the exclusion of children <3 
years of age. 

30 

To further substantiate our findings and to investigate whether parasites from patients with 
severe P. falciparum malaria expressed particular VSA, we used the 96 x 68 recognition 
matrix (Fig. 1) to search for patterns of similarity in the VSA Ab recognition of the parasite 
isolates. Hierarchical cluster analysis identified three main clusters (data not shown). In 

35 one of these (cluster I), all but one (90%) of the isolates were from severe cases, whereas 
this was the case for only 5 of 20 (25%) in cluster II. The third and largest cluster (III) 
showed an intermediate pattern, with 21 of 38 (55%) isolates from patients with severe 
malaria (Fig. 3). The proportion of isolates from severe patients in the three clusters was 
thus quite different (by X 2 test, p = 0.009). When we analyzed the age distribution of the 

40 patients donating the parasites within the clusters, cluster I was composed of parasites 
from young patients, whereas the other two clusters contained parasites from older 
patients (Fig. 3A). The cluster-specific differences in patient age composition did not quite 
reach conventional statistical significance (by Kruskal-Wallis test, p = 0.06). The 
distribution of the parasite-specific sum of scores among the clusters showed that cluster I 
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was composed entirely of parasites expressing very well-recognized VSA, whereas the 
opposite was true for cluster II. Again, cluster III formed an intermediate group (Fig. 3B). 
The distribution of sum of scores was significantly different among the three clusters (by 
Kruskal-Wallis test, p < 0.001), with all pairwise differences being significant (by Dunn's 
5 post-hoc test, p < 0.01 in all cases). 

We have used flow cytometry to measure VSA-specific IgG to provide evidence of 
modulation of VSA expression by acquired immunity. Our method is particularly suited to 
this type of analysis, as it allows unbiased and quantitative analysis of large matrixes of 
10 VSA and corresponding Abs of specified isotype. We found that the level of plasma IgG 
recognition of VSA expressed by P. falciparum isolates obtained from patients with severe 
malaria was approximately twice that of VSA from nonsevere isolates (Fig. 4/1). In a similar 
way we found that VSA Ab recognition of isolates from young patients (3-4 years of age) 
was -2-foid that of isolates from older patients (5-11 years of age; Fig. 4B). 

15 

Thus, the VSA antibody recognition of parasites from severe patients is broader and more 
intense than recognition of VSA expressed by parasites from other malaria patients (Figs. 1 
and 2), independently of the age of the patient. Thus, our data suggest that acquisition of 
VSA-specific Ab responses gradually restricts the repertoire of VSA that are compatible with 
20 parasite survival in the semi-immune host. Furthermore, it appears to limit the risk of 
severe disease by preventing the expression of VSA likely to cause life-threatening 
complications, such as cerebral malaria and severe anaemia. 

Example 2: 

25 Sub-grouping of Plasmodium falciparum 3D? var genes based on sequence analysis of 
coding and non-coding regions 

PfEMPl is a polymorphic family of high molecular weight adhesion antigens expressed on 
the surface of infected erythrocytes. PfEMPl is an important target for protective immunity 
30 and is implicated in the pathology of malaria through its ability to adhere to host 
endothelial receptors. The accumulation of antibodies against a broad repertoire of 
PfEMPls is probably the functional basis for the natural acquisition of immunity to malaria 
(Bull et al. 1998) 

All var genes are characterised by a two-exon structure. Exon 1 encodes a large extra- 
35 erythrocytic and highly variable region containing two to seven Duffy-binding like (DBL) 
domains and mostly one or two cysteine-rich inter-domain region (CIDR) domains. Based 
on sequence homologies, the DBL domains can be sub-divided into a, p,y, 5, and e types 
and the CIDR domains into CIDRa other (CIDR-O) types. A subset of var genes 
furthermore contains a second cysteine-rich domain called C2. Exon 2 encodes the intra- 
40 erythrocytic (cytoplasmic) and conserved part of the protein. 



The entire genome of the P. falciparum clone 3D7 genome is now known, including its 
complete var gene repertoire. Figure 9 shows the domain structure of each of the 59 var 
genes as well as the truncated pseudo-gene PFE1 640\n. 
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Methods 

m2-l. Nucleotide and deduced amino acid sequences as well as location and transcriptional 
5 directions of 3D7 var and rif genes were obtained from the Plasmodium Genome Resource 
- http://www.plasmodb.org. 

m2-2. Alignments were performed using the ClustalW multiple alignment method, 
European Molecular Biology Laboratory, Heidelberg, Germany at default parameters (Gap 

10 Open: 10.00; Gap Extention: 0.20, Gap Separation Distance: 4, Protein weight matrix: 
Gonnet, DNA weight matrix: identity (IUB)). Alignments were corrected by hand using 
Bioedit (Hall, 1999) to assure homologous sequences for sequence analysis and tree- 
building. For distance tree-building, the var gene 5 1 flanking regions were defined as the 
600 bp, 1.4 kb or 2.0 kb upstream of the translation initiation codon and the 3' flanking 

15 region as the 500 bp downstream of the translation stop codon. The available var flanking 
sequences from other strains than 3D7 varied between 250 and 2100 bp in length. For the 
most part 3' sequences were those retrieved by Mercereau-Puijalon et al (Mercereau- 
Puijalon et al. 2002). Var gene domain structures were defined using definitions described 
in Smith et al. (Smith et al. 2000). DBLaCIDRl domains were aligned from Pro-Cys (PC) of 

20 DBLa homology block A to the conserved GJu-Trp (EW) motif of CIDR M2 area, resulting in 
sequence lengths of 550 to 650 aa. For analysis of DBL relationships sequences covering 
Pro-X-Arg-Arg (PXRR) of DBL homology block B to Glu-Trp (EW) of homology block H were 
aligned. 

m2-3. Distance trees were constructed the by p-distance/neighbour-joining (NJ) method 
25 as well as maximum parsimony (MP) using MEGA version 2.1 (Kumar et al. 2001). Trees 
were bootstrapped 1000 times and compared between NJ and MP tree-building methods to 
assure confidence in topology. Observed clusters from each tree were confirmed visually 
on alignments. 

30 The 1.5 kb 5 ! region of 3D7 var genes has previously been described to group into three 
major sequence groups, upsA, upsB and upsC (Gardner et al. 2002) . To further 
investigate sequence similarities in this region, we analysed the 2 kb upstream sequences 
of all 3D7 var genes and the pseudo var gene PFE1640W. In agreement with Gardner et aL 
(Gardner et al. 2002) the alignments revealed three major sequence groups with high 

35 similarity between sequences of each group. However, two sequences did not align well 
with any of the groups and within the groups, subgroups could be identified. Thus, each of 
the groups were analysed separately. 

Ten var genes had 5' regions belonging to the upsA group and all but one were positioned 
40 head-to-head with a r/fgene, the exception PF08_0141 was head-to-head with another var 
gene (Fig. 10). Using a primer set targeting upsA 3D7 sequences around -900 bp from the 
translation initiation codon, we could PCR amplify products of the expected sizes in 3D7 
genomic DNA, as well as in five of five field-isolates tested (data not shown). These data 
suggests that upsA regions not are unique to 3D7. 
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Alignments and tree-building (Fig. 11) of the 500 bp var 3' regions divided most sequences 
into four clusters (A-D). 13 sequences fell outside these clusters, and the relationship 
between these sequences could not be confirmed by bootstrapping or comparison the two 
5 tree-building methods used. 

Because most PfEMPl molecules contain a semi-conserved head structure comprising of 
DBLla and CIDR1, we restricted the analysis of coding sequences to these domains. In 
3D7 all but one var gene encode a DBLla as the first domain and in all but four genes DBL1 

10 is followed by a CIDR1. Since alignment and tree constructions of DBL1 and CIDR1 
domains individually yielded almost identical clusters, we decided to analyse the head 
structure sequences from the N-terminal region of DBL1 to a conserved motif in the C- 
terminal region of CIDR1 (Fig. 12). Fifty-two sequences, including that of pseudogene 
PFE1640w, could be grouped into five clusters, and four sequences could not be assigned 

15 any of these. When all CIDR sequences are aligned most CIDRls fall into separate clusters 
of CIDRa or CIDRal domains (Robinson et al. 2003). The exceptions are three sequences 
(PF08_0141, PF11_0008, PF13_0003), which fall into a CIDRy cluster. In Figure 12, the 
head structures of these genes fall into group A. Robinson et al. (Robinson et al. 2003) 
found that most CIDR domains bind CD36 but identified nine, which did not. These 

20 constitute cluster A. 

Figure 9 schematically sums up the findings of all the var gene sequence analyses. The 
combination of clusters and chromosomal organization of the var genes indicate that var 
genes can be grouped into three major subgroups, var group A, B and C and two 

25 intermediate groups group B/A and group B/C, which appear to represent transitions 

between these three groups. The two genes previously shown to belong to conserved var 
families, varl and var2, fell outside these groups. Group A var genes were most easily 
defined, whereas the borders of the proposed group B and C were less clear (Fig. 9A). The 
grouping was supported by analyses of both coding and non-coding sequences. However, 

30 the best predictors for the groups were the upstream region and chromosomal 

organization. Thus, genes placed near the telomere and with a transcriptional direction 
towards the telomere all had upsA sequences and formed group A. Group B were 
dominated by telomeric located but centromeric transcribed genes flanked by upsB and 
finally group C harboured all centromeric located genes with a upsC 5'region. 

35 

Group A comprise most large PfEMPls with a domain structure different from the most 
common 4-domain type, which is the predominant domain structure of Group B and C. 
Two genes PF08_0140 and MAL6P1.316 were classified as B/A because they had upsBsh 5' 
regions and chromosomal characteristics in common with group B genes, but had DBLa- 
40 CIDR1 sequences and domain structure characteristic for group A genes. Interestingly, 
these two genes are adjacent to a group A var or a pseudo var gene with an upsA region 
both transcribed in the opposite direction, thereby merging their 5'regions. Adding the 
DBLla-CIDRl of the flanking pseudogene MAL6P1.317 to the alignments placed this 
pseudogene within DBLa-CIDRl cluster A. 
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The fact that 5 T regions predict var gene chromosomal organisation and domain structure, 
and sequence similarities in coding and non-coding regions several thousand bases 
downstream from the translation initiation site implies that recombination, or other 
5 mechanisms of homogenizing exchange is much more likely to occur between var genes 
within a group than between var genes of different groupings. It can be proposed that an 
original ancestral var gene has been duplicated and diverged in the three main types, and 
each of these have then diverged into the genes of each group. In this process information 
may also have been exchanged between genes of different groupings. The data suggests 
10 that some exchange have taken place between groups B and C and some characteristics of 
group A have leaked into these groups, but that characteristics from groups B and C have 
not gained access to group A. 

In conclusion, var genes can be sub-grouped into three major groups (group A, B and C) 
15 and two intermediate groups B/A and B/C representing transitions between the three 

major groups. The best defined var group, group A, comprises telomeric genes transcribed 
towards the telomere encoding PfEMPls with complex domain structures different from the 
4-domain type dominant of groups B and C. A rif subgroup transcribed towards the 
centromere was found neighbouring var genes of group A such that the rif and var 5' 
20 regions merged. This organization appeared to be unique for the group A var genes. 

Example 3: 

Selection of P. falciparum isolate 3D7 for expression of well recognised VSA in vitro 

25 Establishment of the genetic control of changes in VSA expression in response to in vitro 
selection is now possible because of the availability of the entire genomic sequence of the 
P. falciparum clone 3D7, which is a long-term clone derived from P. falciparum NF54 
isolated from a Dutch malaria patient (Delemarre and Van der Kaay, 1979). 

30 As a first step towards direct molecular identification of VSA S M-encoding genes in 3D7, we 
established a method to enforce expression of VSA SM -like antigens in this parasite clone by 
a novel selection method using plasma from semi-immune children with low levels of 
VSA UM -IgG but high levels of VSA SM -IgG (Fig. 5). 

35 Materials and methods 

m3-l. To enrich 3D7 for iRBC expressing VSA well recognized by IgG in the SMI plasma 

pool, 1 *10 8 RBC infected with late trophozoite-stage parasites purified by gelatine 
flotation) were mixed with 200 pL pooled plasma in 3 ml_ culture medium, and 
40 incubated the mixture at room temperature (20-25°c) for 15 min. Antibodies not 

reactive with intact iRBC were removed by washing * 2 (800 g, 30 min). The iRBC 
were then resuspended in 3 mL culture medium and incubated as above with 
100 biotinylated secondary antibody recognizing human IgG (Dako, Giostrup, 
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Denmark). The iRBC were washed twice and resuspended as above and 
subsequently mixed with 100 |JL streptavid in-coated DynaBeads (Dynal, Oslo, 
Norway). Finally, we isolated DynaBeads-coated iRBC by placing the IRBC 
suspension in a magnet field until all DynaBeads had settled at the magnet. The 
5 culture medium containing RBC and iRBC not covered with DynaBeads was 

removed by decantation. This procedure was repeated once before the DynaBeads- 
covered iRBC were transferred to a new culture bottle with medium and uninfected 
RBC for continuation of in vitro culturing. 
m3-2. Human plasma samples used for selection: We used repeated rounds of panning 
10 on DynaBeads coated by IgG from two plasma pools (SMI, SM2) from semi- 

immune Ghanaian children and one plasma pool (SM3) from semi-immune 
Tanzanian children to select 3D7 parasites expressing VSA that were highly 
recognized by IgG in these plasma pools (Staalsoe et al., 2003). 

15 Flow cytometry analysis of unselected 3D7 indicated that its VSA expression was 
heterogeneous but that the majority of the iRBC expressed VSA that were poorly 
recognised by IgG in the plasma of individuals living in areas of endemic P. falciparum 
transmission (Fig. 5). In particular, only few iRBC were specifically labelled by IgG in the 
plasma of semi-immune children with documented high levels of IgG with specificity for 

20 VSA expressed by P. falciparum isolated from patients with severe malaria (VSA SM ) and low 
levels of VSA expressed by patients with uncomplicated malaria (VSA UM ) (Fig. 5A). 
Although IgG levels in the plasma of sympatric adults were higher than in children, they 
were still moderate (Fig. 5B). Taken together, these data indicate that the VSA expressed 
by unselected 3D7 were of the VSA UM type. 

25 

Only a minority of RBC infected by unselected 3D7 expressed VSA SM -type antigens. To 
increase the proportion of iRBC expressing VSA SM -type molecules in the culture, we first 
incubated 3D7 with the SMI pool of plasma from semi-immune children, then with 
biotinylated anti-human IgG and finally with streptavidin-coated DynaBeads. As expected 

30 from the heterogeneous IgG recognition of VSA expressed by unselected 3D7, this resulted 
in some iRBC being covered by multiple DynaBeads while most were devoid of any beads 
(Fig. 6). We subsequently isolated the DynaBeads-covered iRBC by exposing the culture to 
a strong magnetic field, and used only these to propagate the culture further. Flow 
cytometry analysis showed that the IgG recognition of the VSA expressed by the 3D7 SM1 

35 sub-clone obtained by three rounds of selection in this manner was much stronger than 
that of VSA expressed by the unselected 3D7 parental clone (Fig. 5C-D). The biggest 
increase in IgG recognition of VSA following selection was seen in the plasma from children 
with documented high levels of VSA SM -type IgG and low IgG levels with specificity for 
VSA UM -type antigens (Fig. 5A and 5C). This, and the fact that the level of IgG recognition 

40 of 3D7 SM1 iRBC was both high and similar in children (Fig. 5C) and adults (Fig. 5D) show 
that the applied selection method had caused a change from predominantly VSA UM -type 
antigens being expressed in unselected 3D7 to VSA S M-type expression in 3D7 S mi- 
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The 3D7 S mi VSA S M-expressing sub-clone was selected by its reactivity to IgG in a plasma 
pool from West African children living in an area of hyperendemic, seasonal P. falciparum 
transmission. To investigate whether the high VSA-specific IgG reactivity against 3D7 SM i 
extended beyond this region and in a variety of epidemiological settings, we measured 
5 levels of IgG with specificity for the VSA expressed by 3D7 sm in series of plasma samples 
obtained from individuals (8 to 42 years of age) living in an area of hypoendemic, highly 
seasonal transmission in Sudan (Fig. 7A) and from children aged 3-11 years living in area 
of very intense transmission in Tanzania (Fig. 7B). All 20 Tanzanian children had medium- 
to-high levels of IgG unselected 3D7 iRBC and to all of five local parasite isolates, 
10 reflecting the high endemicity in their area of residence (Fig. 7B). Nevertheless, levels of 
IgG with specificity for the 3D7 S mi VSA were higher than those with specificity for 
unselected 3D7 (Fig. 7B). Importantly, levels of 3D7 SM1 VSA-specific IgG were higher than 
IgG with specificity for VSA expressed by any of the local isolates, although 3D7 

sMi was 

selected on IgG from West Africa. Taken together, these data suggest the existence of 
15 substantial conservation in time and space of the commonly and highly recognised VSA SM - 
type iRBC surface antigens. 

In conclusion, these findings open the possibility of identifying VSA of importance in the 
pathogenesis of severe disease (VSA SM ) by comparing VSA gene expression in isogenic 
20 parasites expressing VSA VM - and VSA SM -type iRBC surface molecules that are characterised 
by their differential IgG recognition pattern rather than by their adhesion specificity. 



Example 4: 

3D7 parasite expressing VSAsndoes not adhere to CD36 

25 

Most members of the PfEMPl family of 3D7 posses the capacity to bind C036 and the 
binding motives have been mapped to the N-terminai part of the molecules. PfEMPl that 
bind CD36 are structurally and functionally related (Robinson et al 2003) and belong to 
Groups B, C, or B/C (Lavstsen et al. 2003) and Example 2. In this example we show that 
30 3D7 expressing a VSA UM phenotype bind CD36, whereas 3D7 expressing the VSA SM 
phenotype does not. 



Materials and methods: 

35 m4~l. We used Chinese hamster ovary (CHO) cells transfected to express human CD36 
and cultured by standard methodology to measure iRBC adhesion to these 
receptors (Staalsoe et al. 1999; Hasler et al. 1993; Rogerson et al- 1995). In brief, 
parasites were radiolabeled by incubating the cultures overnight in the presence of 

H-phenylalanin (1 MBq for a standard culture containing 200 pi packed RBC). 
40 Wildtype and CD36 transfected CHO cells were grown to a monolayer in 96-well 

microtitre plates (Nunc, Roskilde, Denmark). Late-stage-enriched iRBC (100 pi, 
IxlO 7 RBC/ml) were added to the CHO cell monolayer and incubated for one hour 
at 37°C before unbound iRBC were washed away from the CHO cell monolayer. 
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Finally, the number of CHO-adhering iRBC was determined by liquid scintillation 
spectrometry. 

To investigate the relationship between VSA SM expression and iRBC adhesion to CD36, we 
5 compared the ability of RBC infected by 3D7 and 3D7 SM i to adhere to CD36-transfected 
(CD36-CHO) CHO cells (Fig. 8). While 3D7 showed four- to 10-fold higher adhesion to 
CHOCD36 cells than to wild-type CHO-0 cells, adhesion of 3D7 SM1 to the transfected cells 
was never more than twice that to wild-type cells (Fig. 8). The average adhesion ratio was 
30-3 for 3D7 and 1-4 for 3D7 S mi, corresponding to a difference of 4-9 [95% confidence 
10 interval: (1-9-7-9)]. These results indicate that there is a relationship between VSA SM 
expression and the loss of ability of iRBC to adhere to CD36. 

In conclusion, these results indicate that parasites expressing the VSA SM phenotype do not 
adhere to CD36. 

15 

Example 5: 

3D7 parasite expressing VSA SM adheres to PECAM (CD31) and VCAM-1 

To investigate the relationship between VSA SM expression and iRBC adhesion to PECAM 
20 (CD31), VCAM-1, ICAM-1, and CD36, we assessed the ability of RBC infected by 3D7 SM1 to 
adhere to TrHBMECs expressing all of these receptors except CD36. 



Materials and methods: 

25 m5-l. We used transformed human bone marrow endothelial cells (TrHBMECs) cultured by 
standard methodology to measure iRBC adhesion to receptors on endothelial cells 
receptors (Jensen et al. 2004 and references herein). In brief, parasites were radio- 
labelled by incubating the cultures overnight in the presence of 3H-hypoxanthin (1 MBq for 
a standard culture containing 200 pL packed RBC) in RPMI 1640 supplemented with 10% 

30 NHS (having low concentration of unlabelled hypoxanthin). Late-stage-enriched iRBC (100 
pi, 1x107 RBC/ml) were added to the TrHBMECs and incubated for one hour at 37°C 
before unbound iRBC were washed away. Antibodies specific for PECAM (CD31), VCAM-1, 
ICAM-1 and CD36 were added to the TrHBMECs to assess the %-binding of iRBC following 
blocking of these receptors on the surface of TrHBMECs. The number of TrHBMEC-adhering 

35 iRBC was determined by liquid scintillation Spectrometry and calculated as percentage 
binding. 

Binding of 3D7 SM i was inhibited by antibodies to PECAM (CD31) and VCAM-1, but not by 
any of the other antibodies. Thus, indicating that 3D7 S mi expressing VAR4 on its surface 
40 binds to PECAM (CD31) and VCAM-1 (data no shown). 
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Example 6: 



Selection of P. falciparum isolate 3D7 for expression of well recognised VSA In vitro results 
in selective up-regulation of Group A var genes 

Parasite-encoded PfEMPl proteins expressed on the surface membrane of iRBC mediate 
the adhesion of such erythrocytes to a range of host receptors. The PfEMPl proteins are 
encoded by the var gene family containing 50-60 members per haploid parasite genome. 
Different PfEMPl molecules have different receptor specificities, and clonal switching 
between expression of the various var gene products in a mutually exclusive 
manner allows the parasite to modify its adhesion properties. Gene expression and 
switching can be examined using gene-specific primers and real-time PCR. 
Using real-time quantitative PCR we compared var gene expression in the parasite line 
3D7 before (3D7 UM ) and after (3D7 SM ) selection for antibody recognition (Example 3), 

m6-l. RNA was purified from 3D7 UM and 3D7 SM and used for the synthesis of cDIMA. "Total 
RNA was prepared with Trizol LS ( http : //www. in vitrogen .com ) as recommended by 
the manufacturer, and treated with DNAsel ( http://www.invitrogen.com ). Absence 
of DNA in RNA samples was confirmed by stable base fluorescence after 40 cycles 
of real-time PCR with seryhtRNA synthetase primers (Saianti et al., 2003)) (Table 
1). One jag of DNA-free RNA was reversed transcribed using Superscript II and 
random hexamer primers (( http://www.invitrogen.com ^ at 25°C for 10 min, 42°C 
for 50 min followed by 70°C for 15 min in a total volume of 40 

Table 1 

Primer sets used in real-time PCR assays to specifically amplify 59 var genes and one 
pseudogenes (underlined). Where several genes are listed next to a single primer set, 
primer targets in the listed genes were identical. 

Primer Forward primer Reverse primer Target gene(s) 



set 



1 

4 
5 
6 



TGCGCTGATAACTCACAACA AGG GGTTC ATCGTC ATCTTC 
GACGAGGAGTCGGAAAAGAC TGGACAGGCTTGTTTGAGAG 
GTGCACCAAAAGAAGCTCAA ACAAAACTCCTCTGCCCATT 
GAGGCTTATGGGAAACCAGA AGGCAGTCTTTGGCATCTTT 
GACGGCTACCACAGAGACAA CGTCATCATCGTCTTCGTTT 



PFA0005W 
PF10_0001 
PF10_0406 
PF11_0007 



7 



PF11_0008 (SEQ ID NO.: 
3) 



8 

9 

10 

11 

12 

15 

17 

18 

19 

20 



TG CTGAAG ACCAAATTGAGC 

TCGATTATGTGCCGCAGTAT 

TGGTGATGGTACTGCTGGAT 

GACGCCTGCACTCTCAAATA 

AGCAAAATCCGAAGCAGAAT 

C ATCC ATTACG C AG G ATACG 

GGCACGAAGTTTTGCAGATA 

CG G AGGAGGA A AA ACAAG AG 

CGGAATTAGTTGCCTTCACA 

CACAGGTATGGGAAGCAATG 



TTGTTGTGGTGGTTGTTGTG 

TTCCCGTACAATCGTATCCA 

TTTATTTTCGGCAGCATTTG 

TTGGAGAGCACCACCATTTA 

CCCACAGATCTTTTCCTCGT 

AAATAGGGTGGGCGTAACAC 

TTTGTGCGTCTTTCTTCGTC 

TGCCGTATTTGAGACCACAT 

CATTGGCCACCAAGTGTATC 

CCATACAGCCGTGACTGTTC 



PF11_0521 

PFL0020W 

PFL0030C 

PFL0935C 

PFL1950W 

PFL1960W 

PFL2665C 

PFL0005W 

PF13_0364 



PF13_0003 (SEQ ID NO.: 
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Primer Forward primer Reverse primer Target gene(s) 

set 



5) 



21 


CAM 1 1 1 GGGTGTGGAATCA 


C A CTG G C C ACC A AGTGTATC 


PFB1055C 


22 


ATGTG CG CTACAAG A AGCTG 


TTGATCTCCCCATTCAGTCA 


PFB0010W 


23 


CAATCTGCGG C A ATAG AG AC 


CCACTGTTGAGGGGTTTTCT 


PFC1120C/PFC0005W 


25 


ATATGGG A AG G G ATG CTCTG 


TGAACCATCGAAGGAATTGA 


PFD0020C 


26 


ACCGCCCCATCTAGTGATAG 


CACTTGGTGATGTGGTGTCA 


PFD0615C 


27 


TAAAAGACGCCAACAGATGC 


TCATCGTCTTCGTCTTCGTC 


PFD0625C 


28 


ACTTTCTGGTG G GG AATC AG 


TTCACCGCCACTTACTTCAG 


PFD0630C/PFD0635C 


30 


GACGACGATGAAGACGAAGA 


AG ATCTCCG C ATTTCCA ATC 


PFD0005W 


32 


ACCTAATGGCGAAAACCAAG 


ACACTTGCCTTCATCCACTG 


PFDIOOOc 


34 


TGCAACGAAACATTAGCACA 


AGCAGGGGATGATGL 1 1 IAC 


PFD1015C 


35 


AAACACGTTGAATGGCGATA 


GACGCCGAGGAGGTAAATAG 


PFD1235W/MAL7P1.1 








(SEQID NO.: 1) 


36 


TGACGACTCCTCAGACGAAG 


CTCC ACTG ACG G ATCTGTTG 


PFD1245C 


37 


AAGAAAGTGCCACAACATGC 


GTTCGTACGCCTGTCG 1 1 IA 


PFE1640W 


38 


G A AGCTG GTG GTACTG ACG A 


TA 1 1 11 CCCACCAGGAGGAG 


PFE0005W 


39 


ATTTGTCGCACATGAAGGAA 


AACTTCGTGCCAATGCTGTA 


MAL6P1.252 


41 


G GTGTCA AGG CAG CTAATG A 


TATGTCCTGCG CTA 1 1 1 1 GC 


PF08_0141 


43 


GTCGTGGAAAAACGAAAGGT 


TATCTATCCAGGGCCCAAAG 


PF08_0142 


44 


ATGTGTGCGAGAAGGTGAAG 


TGCCTTCTAGGTGGCATACA 


MAL6P1.4 


45 


CAA 1 1 i 1 ( CCG ACGCTTGTA 


CACATATAGCGCCGTCCTTA 


PF07_0048 


46 


GCGACGCTCAAAAACATTTA 


TCATCCAACGCAATCTTTGT 


PF07„0050 


49 


GTTG AGTCTG CG GCA ATAG A 


CTGGGGTTTGTTCAACACTG 


PF07__0049 


50 


CACACATGTCCACCACAAGA 


ACCCTTCTGTGGTGTCTTCC 


MAL7P1.56 


51 


ACGTGGTGGAGACGTAAACA 


CCTTTGTTGTTGCCACTTTG 


MAL7P1.55 


52 


CGTGGTAGTGAAGCACCATC 


CCCACC 1 1 Ci 1 GTGGTTTCT 


PFO7_0051 


53 


TGACGACGATAAATGGGAAA 


1 1 CI 1 1 1 GGAGCAGGGAGTT 


PF07JD139 


54 


ACCAAGTGGTGACAAAGCAG 


G G GTG GC AC AC A A AC ACTAC 


PFD1005c/PFD1015c 


55 


TTTGTCCGGAAGACGATACA 


ATCTG GG GCAGA ATTACCAC 


PF08„0106 


58 


CACACGTGGACCTCAAGAAC 


AAAACCGATGCCAATACTCC 


PFI1830C 


66 


CCTAAAAAGGACGCAGAAGG 


CCAGCAACACTACCACCAGT 


PF08_0107 


67 


AAGGGAAGATTGGTGGACAG 


AGGGGGATCAGTATCACGTC 


MAL6P1.314 


91 


ACAAAGGAACGTCCATCTCC 


GCCAATACTCCACATGATCG 


PF13_0001 


92 


TG CAAG GGTG CTA ATGGTA A 


CCTGCAI 1 1 1 GACATTCGTC 


PF08_0103 


93 


G AC AA ATACGG CG ACTACG A 


TGTTTCACCCCATTCTTCAA 


MAL6P1.1 


94 


TGGAAAGAACATGGACCTGA 


TTCCTCGAGGGAAGAATCAC 


MAL6P1.316 


95 


TCACAACCTGACCCCCTACT 


TCTTCGTCGTTGTCATCCTC 


PFD0995C 


96 


TGACCAAGACGAAGTATGGAA TTG ATCTCTGTTCG CTGTCC 


PFI1820W 


97 


TCATTATGGGAAGCACGATT 


TG ATTTCTACC ATCGCAAG G 


PFA0015C 


98 


ATGGTGGCAAACTTGTGAGA 


TCCAATTGGTCTCCTTGACA 


PF08_0140 


99 


AGGGAGCATCAGGTGGTAGT 


GCTGTGCATGCJ 1 1 1 ICATT 


MAL7P1.50 


100 


GATCAAAAGAGGCGGAGAAG 


TTCCAATTGGGGAA i 1 1 1 IC 


PFI0005W 


101 


CAAGAGACACAACCGGAAGA 


CACTTCC A ATTG G G G AATTT 


PFA0765C 


60* 


AAGTAGCAGGTCATCGTGGTT TTCG G C ACATTCTTCC ATA A 


PF07JD073 



* Endogenous control gene: seryl-tRNA synthetase (PF07_0073) 
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To study gene expression of individual var genes a specific primer set for each of 59 var 
genes and one pseudogenes in the 3D7 genome was designed (Table 1), and real-time 
PCR was performed on cDNA from 3D7 UM and 3D7 SM . 

5 m6-2. Real-time PCR was done using a Rotorgene thermal cycler system 

( http : //www. corbett resea rch .com ). Change in PFD1235w/MAL7PlA (SEQ ID NO.: 
1), PF11_0008 (SEQ ID NO.: 3) , and PF13_0003 (SEQ ID NO.: 5) gene 
transcription after selection for antibody recognition. Specific primers targeting SEQ 
ID NO.: 1, SEQ ID NO.: 3, and SEQ ID NO. : 5 (Figs. 13 and 14) were used to 

10 measure transcription before (3D7 UM ) and after antibody selection (3D7 sM ). The fold 

change in transcription levels, normalised against seryhtRNA synthetase, was 
calculated for each primer set by the AACt method (User Bulletin #2, Applied 
Biosystems, http://www.appliedbiosystems.com ). Reactions were performed in 
20 \x\ volumes using QuantiTect SYBR Green PCR master mix and 0.5 mM primers, 

15 according to manufacturer's instructions ( http://www.qiaQen.com ). PCR cycling 

conditions optimised for P. falciparum cDNA were 95°C for 15 min followed by 40 
cycles of 94°C for 30 sec, 54°C for 40 sec, and 68°C for 50 sec with a final 
extension at 68°C for 10 min. Data acquisition was done at the end of elongation of 
each cycle. Specificity of amplification was ascertained by melting-curve analysis of 

20 each PCR product. Electrophoresis of PCR products and EB staining was performed 

and revealed no bands from no-template controls and single bands for all targets in 
cDNA PCR products. Quantification was done using the Rotorgene software version 
4.30 f http: //www. qiaaen.com ). Transcription levels of the endogenous 
P. falciparum genes actin, seryhtRNA synthetase and aldolase were analysed in 

25 order to determine the most accurate endogenous control. P. falciparum seryhtRNA 

synthetase displayed the most uniform transcription profile in different parasite 
isolates and an unchanged pattern throughout the parasite life and was thus used 
for calculations of fold changes in var gene transcription by the ACT method 
(described in User Bulletin #2, Applied Biosystems, 

30 http://www.appliedbiosvstems.com ). 

m6.3. Genes (PF11_0521, PFL0030C, PFL1950w, PFL005W, PF13_003 (SEQ ID NO.: 5), 
PFB1055C, PFD1235W/MAL7P1.1 (SEQ ID NO.: 1), PFE1640W, PFI1830, 
PF08_0107, MAL6P1.314, MAL6P1.316, PFD0995c, PFA0015C, seryhtRNA 

35 synthetase) used for absolute quantification were PCR amplified, ligated into the 

pCR2.1 TOPO vector and transformed into Eschericia coli TOPO10 cells (TA cloning 
System, Invitrogen). Plasmids were purified using Qiagen Miniprep spin columns 
(Qiagen, Merck Eurolab, Albertslund, Denmark) and the identity of inserts verified 
by subsequent sequencing on an ABI Prism 310 (Perkin-Elmer) using the Big Dye 

40 terminator reaction mix, ABI Prism proofreading and translation software, and the 

P. falciparum 3D7 genome database ( http://www.plasmodb.org ). 
Plasmid concentrations were determined by spectrophotometry and serial 10-fold 
Tris-EDTA buffer (pH=7.5) dilutions ranging from 5xl0 8 to 5x10° template copies 
per real-time PCR reaction. Triplicate real-time measurements were made for each 
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dilution and a best-fit standard curve was generated using the RotorGene 2000 
software. The standard curves were linear across a range of seven logs of DNA 
concentrations with correlation coefficients between 0.9779 and 0.9969. The 
detection limit of the system was > 20 copies (data not shown). The coefficient of 
5 variance (CV) was calculated as 100*(standard deviation/mean). RT-PCR was 

performed as described using 1 pg total RNA in a total volume of 40 pi of which 0.5 
pi was subsequently used for real-time PCR. Absolute values were calculated from 
the standard curves. 



10 Real-time PCR followed by calculating fold change in 3D7 UM compared to 3D7 sM 

demonstrated marked upregulation of three Group A var genes PFD1235/MAL7P1.1W (SEQ 
ID NO.: 1), PF1 1^0008 (SEQ ID NO.: 3), and PF13_0003 (SEQ ID NO.: 5). The 
transcription of Group B, C or B/C var genes was either downregulated or not regulated in 
the 3D7 SM compared to 3D7um (Figs. 13 and 14, Table 2). The up-regulated var genes 

15 PFD1 235W/MAL7P1 . 1 (SEQ ID NO.: 1), PF11_0008 (SEQ ID NO.: 3), and PF13_0003 (SEQ 
ID NO.: 5) were also the most dominant var transcript when doing absolute quantification 
(Table 2) with var gene-specific primers. The upregulated and dominant 
PFD1 235W/MAL7P1 . 1 (SEQ ID NO.: 1), PF 11^0008 (SEQ ID NO.: 3), and PF13_0003 (SEQ 
ID NO.: 5) gene showed 3 to 10-fold higher level of expression following antibody selection 

20 both in ring-stage and trophozoite/schizont-stage 3D7 parasites (Table 2). 
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In conclusion, IgG selection resulted in marked upregulation of SEQ ID NO.: 1, 3, and 5. 
Example 7: 

5 Selection of P. falciparum isolate 3D7 for adhesion to human endothelial cells in vitro 
results in selective up-regulation of Group A var genes 

The particular virulence of P. falciparum is due to the ability of infected erythrocytes to 
adhere to a variety of host receptors on the endothelial lining such as ICAM-1, VCAM, 
10 thrombospondin, ELAM-1, and CD36, and avoid splenic clearance. To show that selection 
of 3D7 on human endothelial cells leads to a serological phenotype and a transcription 
profile similar to 3D7 SM i we did several rounds of selection on bone marrow derived 
endothelial cells generating 3D7 en do. 

15 m7-l. We used human endothelial cells to select 3D7 for increased VSA SM expression. 

Human endothelial cells were cultured by standard methodology to measure iRBC 
adhesion to receptors on the endothelial cells. In brief, parasites were radiolabeled 

by incubating the cultures overnight in the presence of H-phenylalanin (1 MBq for 
a standard culture containing 200 pi packed RBC). Late-stage-enriched iRBC (100 

20 pi, lxlO 7 RBC/ml) were added to the endothelial cell monolayer and incubated for 

one hour at 37°C before unbound iRBC were washed away from the endothelial cell 
monolayer. Finally, the number of endothelial celi-adhering iRBC was determined 
by liquid scintillation spectrometry. 
m 7-2. Real-time PCR was done using a Rotorgene thermal cycler system 

25 ( http : //www . corbettresea rch . com ) . Change in PFD1 235w/MAL7Pl . 1 (SEQ ID NO.: 

1), PF11_0008 (SEQ ID NO.: 3) , and PF13_0003 (SEQ ID NO.: 5) gene 
transcription after selection for antibody recognition. Specific primers targeting SEQ 
ID NO.: 1, 3, and 5 were used to measure transcription before (3D7 UM ) and after 
selection for adhesion to human endothelial cells (3D7 endo ). The fold change in 

30 transcription levels, normalised against seryl-tRNA synthetase, was calculated for 

each primer set by the AACt method (User Bulletin #2, Applied Biosystems, 
http://www.appliedbiosystems.com ). Reactions were performed in 20 \x\ volumes 
using QuantiTect SYBR Green PCR master mix and 0.5 mM primers, according to 
manufacturer's instructions ( http://www.qiaQen.com ). PCR cycling conditions 

35 optimised for P. falciparum cDNA were 95°C for 15 min followed by 40 cycles of 

94°C for 30 sec, 54°C for 40 sec, and 68°C for 50 sec with a final extension at 
68°C for 10 min. Data acquisition was done at the end of elongation of each cycle. 
Specificity of amplification was ascertained by melting-curve analysis of each PCR 
product. Electrophoresis of PCR products and EB staining was performed and 

40 revealed no bands from no-template controls and single bands for ail targets in 

cDNA PCR products. Quantification was done using the Rotorgene software version 
4.30 ( http://www.aiaqen.com ). P. falciparum seryl-tRNA synthetase was used for 
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calculations of fold changes in var gene transcription by the ACT method (described 
in User Bulletin #2, Applied Biosystems, http://www.appliedbiosvstems.com ). 

The resulting cell line 3D7 en do showed a FACS profile similar to that of 3D7 SM i (Example 3 
5 and Fig. 5) and a marked up-regulation of the PFD1235w/MAL7Pl.l (SEQ ID NO. : 1), 
PF11_0008 (SEQ ID NO.: 3) 7 and PF13_0003 (SEQ ID NO. : 5) var genes as analysed by 
real-time quantitative PCR (Fig. 20). 

In conclusion, we find that selection of 3D7 for adhesion to human endothelial cells confer 
10 a serological phenotype, VSA SM similar to that of parasites causing severe malaria and to 
3D7 selected with childrens plasma. As in 3D7 SM1 the acquisition of the VSA SM phenotype in 
3D7 endo is accompanied by marked upregulation of the PFD1235w/MAL7Pl.l (SEQ ID NO.: 
1), PF11_0008 (SEQ ID NO.: 3), and PF13_0003 (SEQ ID NO.: 5) var genes. 

15 Example 8: 

Northern blots of antibody-selected, ring-stage 3D7 showed strong hybridization signals 
corresponding to full-length transcripts of PFD1235w/MAL7Pl.l (SEQ ID NO.: 1). 

Materials and methods: 

20 

m8-l. For Northern blotting, we used 10 ng of total RNA separated in a standard 

denaturing MOPS-formaldehyde agarose gei and transferred to positively charged 
nylon membranes overnight (Sambrook et al., 1989). RNA was cross-linked to the 
membrane by baking for 30 min at 120°C. DIG-labeled RNA probes were generated 
25 using the DIG RNA labeling kit (Roche, Hvidovre, Denmark). Hybridization, 

washing, and detection were done according to the manufacturer's 
recommendations with a hybridization temperature of 65°C in DIG Easy Hyb buffer 
(Roche). 

30 Northern blots of antibody-selected, ring-stage 3D7 showed strong hybridization signals 
corresponding to full-length transcripts of the upregulated var gene PFD1235w/MAL7Pl . 1 
(SEQ ID NO.: 1) (Fig.l5A). The corresponding signals were either absent or much weaker 
in Northern blots of unselected 3D7. Similarly, PF11_0008 (SEQ ID NO.: 3) and PF13__0003 
(SEQ ID NO.: 5) were found to be present as full-length transcripts in antibody-selected 

35 3D7, but not in 3D7 (data not shown). 

Taken together, these results shows that antibody-selected 3D7 expressing VSA S M~type 
antigens transcribe full-length PFD1235w/MAL7Pl.I (SEQ ID NO.: 1), PF11_0008 (SEQ ID 
NO.: 3) and PF13_0003 (SEQ ID NO.: 5). 
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Example 9: 

The PFD1235W/MAL7P1.1 (SEQ ID NO.: 1), PF11_0008 (SEQ ID NO.: 3), and PF13_0003 
(SEQ ID NO.: 5) genes that is selectively upregulated in P. falciparum isolate 3D7 following 
selection for antibody recognition in vitro belong to the Group A var genes and share their 
5 characteristics 

From Example 3, 4, 5, 6 and 7 it appears that the Group A PFD1235w/MAL7Pl.l (SEQ ID 
NO. : 1), PF11_0008 (SEQ ID NO.: 3) and PF13_0003 (SEQ ID NO.: 5) genes encodes 
VSAsM-Iike proteins, and that this protein is involved in the acquisition of the VSA SM 

10 phenotype following antibody-selection of VSA UM -expressing 3D7 (3D7 SM ). The fact that 
almost identical results were obtained using plasma pools from children living in West 
Africa (pools SMI and SM2) and East Africa (plasma poo! SM3) indicate that a PfEMPl 
similar to that encoded by PFD1235w/MAL7Pl.l (SEQ ID NO.: 1), PF11_0008 (SEQ ID 
NO.: 3) and PF13_0003 (SEQ ID NO.: 5) are present in P. falciparum parasites transmitted 

15 on both sides of this vast continent. 

In conclusion, PFD1235w/MAL7Pl.l (SEQ ID NO.: 1), PF11_0008 (SEQ ID NO.: 3) and 
PF13_0003 (SEQ ID NO.: 5) seems to belong to a sub-group of var genes, with a 
constrained ability to recombine, that are functionally conserved and commonly recognised 
20 by VSA 5M ~antibodies. 



Example 10: 

PFD1235W/MAL7P1.1 (SEQ ID NO.: 1), PF11J3008 (SEQ ID NO.: 3) and PF13_0003 (SEQ 
ID NO. : 5) belongs to a var sub-family that is common and highly conserved in 
25 P. falciparum isolates 

PFD1 235W/MAL7P1 . 1 (SEQ ID NO.: 1), PF1 1_0008 (SEQ ID NO.: 3) and PF13_0003 (SEQ 
ID NO.: 5) are the dominant transcript and is highly upregulated in the P. falciparum 
isolate 3D7 following selection for antibody selection and adhesion to human endothelial 

30 cells (3D7 SM and 3D7 en dc>; Example 3, 4, 5, 6 and 7). All the 3D7 var genes differ from each 
other, but smaller blocks of sequences with high similarity are found in various var genes. 
To date, only three sub-families of PfEMPl have been defined (varl-3) (Salanti et al. 2002; 
Salanti et al. 2003; Kraemer and Smith, 2003). Apart from the varl-3 sub-families, all 
PfEMPl genes described so far from other parasite isolates differ from each other, and 

35 from the 3D7 var genes. It has therefore been assumed that the global repertoire of var 
genes is very large. This constitutes an obvious obstacle for the development of vaccines 
based on var genes and their products, as a high degree of conservation is a prerequisite 
for vaccine pan-reactivity. 

40 To test the degree of inter-genomic diversity of PFD1235w/MAL7Pl.l(SEQ ID NO.: 1), 
PF11_0008 (SEQ ID NO.: 3) and PF13_0003 (SEQ ID NO.: 5); 60 different Ghanaian 
parasite isolates obtained from the peripheral blood of P. falciparum malaria patients were 
tested. 
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ml 0-1. Genomic DNA was isolated ( htt p : //www . clo ntech .com ) using the NucleoSpin 

purification kits according to the manufacturer's recommendations. PCR was carried 
out in 0.2-ml microfuge tubes in a reaction volume of 20 jal using a PE2400 PCR 
5 machine ( http://www.perkin-elmer.com ). Final concentrations of the PCR reagents 

were as follows: HotstartTaq polymerase ( http://www.qiaqen.com ): 0.1 U; 
primers: 1 pM; dNTP: 2.5 mM, each; and MgCl 2 : 1.5 mM). Cycling conditions were 
optimised for P. falciparum DNA: 15 min at 95°C followed by 30 cycles of 30 sec at 
94°C, 30 sec at 50°C, and 4 min at 68°C, with a final extension for 15 min at 68°C. 
10 The PCR products were visualised and size was determined in a 1% agarose gel 

containing EB. PCR amplification using PFD1235w/MAL7Pl.l, PF11_0008, and 
PF13__0003 specific primers (Table 1) on genomic DNA from 60 isolates from 
Ghanaian children yielded a definite band of the expected size of 160 bp in 10, 3 7 
20, respectively of the isolates. 

15 

m!0-2.To demonstrate the extent of sequence similarity, 2304 bp corresponding to 768 
amino acids were cloned and sequenced. Gene-specific primers for 
PFD1 235W/MAL7P1 . 1 (SEQ ID NO.: 1) were used to perform PCR on genomic DNA 
from PCR positive isolates. PCR products were gel-purified using the Qiagen gel 

20 purification kit according to the manufacturer's instructions 

( http://www.qiaqen.com ). Purified PCR fragments were ligated into the pCRII TOPO 
vector using TOPO TA cloning kit, and transformed into TOP10 competent cells 
f http://www.invitroqen.corrO . Positive clones were selected and propagated. 
Plasmid preparations were made using MiniPrep spin columns 

25 f http://www.qiaaen.corrO . Sequencing was performed on an ABI Prism 377 using 

the Big Dye terminator reaction mix ( http://www.perkin-elmer.com l. Proofreading 
and translation were done with ABI Prism software. 

From the sequencing it was found that 556/768 cloned from BM021 and BM048 were 
30 identical to the PFD1 235w/MAL7Pl . 1 (SEQ ID NO.: 1) sequence (Fig. 19). Alignments were 
done using ClustalW and default settings. 

Taken together, these data show that PFD1235w/MAL7Pl.l (SEQ ID NO.: 1) belongs to a 
conserved and common gene sub-family and thus fulfils two required criteria for any 
35 candidate gene in vaccine development. Similarly PF11„0008 (SEQ ID NO.: 3) and 

PF13_0003 (SEQ ID NO.: 5) were found to belong to var gene sub-families families with 
similar identities. 

Example 11: 

40 3D7 S m expresses VAR4 (SEQ ID NO.: 2) on the surface of infected erythrocytes 

To generate recombinant proteins of VAR4 (SEQ ID NO.: 2) domains were cloned into the 
pBAD-TOPO vector (Invitrogen) by PCR using the following domain-specific oligonucleotide 
primers and subsequently into the Baculovirus transfer vector pAcGP67: 
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NTS.Fw: 


5' 


- G A ATTC ATG G G G A ATG C ATC ATC A- 3 ' 


NTS.Rv: 


5' 


- ATTTCTATCTTCTG CGTT- 3 ' 


DBLl-a.Fw: 


5' 


-GAATTCAACGCAGAAGATAGAAATC-3 ' 


DBLl-a..Rv: 


5' 


- CTTCG ATGTATATGTCTG - 3 ' 


CIDRl-a.Fw: 


5' 


-G A ATTCG ACG CTA A A ACTG ATAGTA- 3 ' 


CIDRl-a.Rv: 


5' 


- ACATATATCG CC ATTCA ACG - 3 ' 


DBL2-p.Fw: 


5' 


- G A ATTCTTG A ATG G CG ATATATGTA- 3 ' 


DBL2-p.Rv: 


5' 


- GTATATAG CTG ATA CTGT- 3 ' 


C2.Fw: 


5' 


-GAATTCACAGTATCAGCTATATACC-3 ' 


C2.Rv: 


5' 


-ATAACATGGTTGTTGTGA-3 ' 


DBL3-p„Fw: 


5' 


-GAATTCTCACAACAACCATGTTATG-3 ' 


DBL3-p..Rv: 


5' 


-ATCTATTCCACCTGTAGT-3 ' 


DBL4-y.Fw: 


5' 


- G A ATTC ACTAC AG GTG G AATAG ATC A- 3 ' 


DBL4-y.Rv: 


5' 


-ACATGCGGCATTGAGACT-3 ' 


DBL5-S.Fw: 


5' 


- G AATTCAGTCTC A ATG CCG C ATGTG - 3 ' 


DBL5-5.Rv: 


5' 


-TCTACAATGTCTGGCACA-3 ' 


CIDR2-p.Fw: 


5' 


- G AATTCTGTG CCAG ACATTGTAG ATC- 3 ' 


CIDR2-p.Rv: 


5' 


-III G CCACTAG GTACGT- 3 ' 



mll-1 

5 For production of carboxy-terminally V5 epitope and histidine-tagged protein the PCR 
amplified inserts were excised from pBAD-TOPO constructs by EcoRI and Pmel digestion 
and subsequently sub-cloned into the EcoRI and blunt-ended Sg/II sites of the Bacuiovirus 
transfer vector pAcGP67-A (BD Biosciences, Br0ndby, Denmark). Recombinant Bacuiovirus 
were generated by co-transfection of the 

10 pAcGP307-A-NTS,pAcGP67-A-DBLl-a, pAcGP67-A-CIDR15-a, pAcGP67-A-DBL2-p, pAcGP67- 
A-C2 / pAcGP67-A-DBL3-p, pAcGP67-A-DBL4-y, pAcGP67-A-DBL5-5, or pAcGP67-A-CIDR2-pD 
construct genes and Bsu330l linearized Bakpak6 Bacuiovirus DNA (BD Biosciences 
Clontech) into insect Sf9 cells. Recombinant products were expressed by infection of insect 
High-Five cells with recombinant Bacuiovirus using a multiplicity of infection often. 

15 Recombinant proteins were purified from culture supernatants on Co2+ metal-chelate 
agarose column and eluted with 25 mM HEPES-KOH (pH=7.30), 0.5 mM MgCI 2/ 0.5 mM 
DTT, 100 mM NaCI, 10% glycerol, and 100 mM imidazole. 

ml 1-2 

20 For production of GST fusion protein a conserved sequence of the intracellular acidic 

segment ATS/Exon2 was sub-cloned into the pGEX-4Tl vector by PCR usingthe following 
oligonucleotide primers ATS - Fw : 5 '-CGGAATTCA A A AC A A AATC ATCAGTAG - 3 ATS-Rv: 
5 '- A TAA GAA TG CG G CCGC GTTGATTACCACTTA ATGTG - 3 ' . The proteins were expressed as 
fusion proteins at the carboxy-terminus of glutathione S-transferase from Schistosoma 

25 japonicum and purified by affinity chromatography on glutathione sepharose 4B 
(Amersham Pharmacia Biotech). 
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mil -3 

To generate murine antibodies against VAR4 domains and ATS/Exon2 the recombinant 
proteins were used to immunize Baib/c mice (5 |jg) and rabbit s (50 jjg) (given 
subcutaneously in Freund's complete adjuvant) followed by two booster injections in 
5 Freund's incomplete adjuvant. 

A DNA vaccination approach to generate antibodies to VAR4 (SEQ ID NO.: 2) domains 
were also used. All domains was cloned into the Eucaryotic TA expression vector pCR3.1 
(Invitrogen) using the following primers: 

10 

DBLl-a.Fw: 5 ' -GCCRCCATGGACGCAGAAGATAGAAATC-3 ' 
DBLl-a.Rv: 5 ' -CTACTTCG ATGTATATGTCT- 3 ' 
CIDRl-a.Fw: 5 '-GCCRCCATGGACGCTAAAACTGATAGTA-3 ' 
CIDRl-a.Rv: 5 ' -CTAACATATATCGCCATTCAACG-3 ' 
DBL2-p.Fw: 5 ' - G CC RCC ATG GTG A ATG G CG ATATATGTA- 3 ' 
DBL2-p.Rv: 5 ' - CTA GTATATAG CTG ATACTGT- 3 ' 
C2. Fw: 5 ' -GCCRCCATGGCAGTATCAGCTATATACC-3 ' 

C2 . Rv : 5 ' -CTAATAACATGGTTGTTGTG A-3 ' 

DBL3-p.Fw: 5 '-GCCRCCATGGCACAACAACCATGTTATG-3 ' 
DBL3-p..Rv : 5 ' -CTAATCTATTCCACCTGTAGT-3 ' 
DBL4-y.Fw : 5 ' -GCCRCCATGGCTACAGGTGGAATAGATCA-3 ' 
DBL4-y.Rv: 5 ' - CTA ACATG CG G C ATTG AG ACT- 3 ' 
DBL5-5.Fw: 5 ' - G C C RCC ATG G GTCTC A ATG CC G C ATGTG - 3 ' 
DBL5-5.Rv: 5 ' -CTATCTACAATGTCTGGCACA-3 ' 
CIDR2-p.Fw: 5 ' -GCCRCCATGGGTGCCAGACATTGTAGATC-3 ' 
CIDR2-p.Rv: 5 '-CTATTTGCCACTAGGTACGT-3 ' 

mll-4. Plasmids were propagated in TOP10 cells (Invitrogen) and plasmid was purified 
using Plasmid GIGA prep kit (Qiagen). Plasmid DNA was injected IM to mice 4 
times with 2 weeks intervals and finally boosted with the recombinant protein 

15 corresponding to the domain. 

ml 1-5. SDS-extracted trophozoite/schizont-stage iRBC were reduced by boiling in the 

presence of L-mercaptoethanol and electrophoresed in the Laemmli sodium dodecyl 
sulfate polyacrylamide gel electrophoresis (SDS-PAGE) system (Sambrook et al., 
1989). Immunoblots were prepared on PVDF membranes by semidry blotting using 

20 standard methods. All available binding sites on membranes were blocked in Tris- 

buffered-saiine-Tween (TBS-T) containing 5% skimmed milk. Blots were probed 
with an antiserum raised against a recombinant protein of the highly conserved 
intracellular acidic segment ATS/Exon2 conserved among most var genes in 3D7, 
CIDRl-ct and DBL5-8 of PFD1235w/MAL7Pl.l (SEQ ID NO.: 1), and pre-immune 

25 mouse sera diluted 1:100 in TBS-T. Bound antibody was detected with relevant IgG 

alkaline phosphatase-conjugated antibody (Dako, Giostrup, Denmark). All washes 
were done using TBS-T. Blots were developed by standard methods using p- 
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nitroblue tetrazolium phosphate (NBT, Sigma) and 5-bromo-4-chloro-3- 
indolylphosphate (BCIP, Sigma) as substrates. 
mll-6. Immunostaining and flow cytometry were performed as described (Staalsoe et al., 
1999; Nielsen et al., 2002) with some modifications. Briefly, 2.5 x 10 5 of MACS 
5 purified, ethidium bromide labeled iRBC were incubated for 1 hr in 20 \x\ of murine 

sera or for 30 min in 5 uL of human sera. All murine sera had been depleted of 
anti-human erythrocyte antibodies by absorption. For immunostaining with murine 
sera, iRBC were sequentially exposed to 100 \xL of 1:25 diluted goat anti-mouse Ig 
(Dako), biotinylated anti-goat Ig (Dako) and 1:200 diluted FITC-conjugated 

10 streptavidin (Dako) for 30 minutes each. For immunostaining with human sera, 

iRBC were incubated in 100 uL of 1:25 diluted biotinylated anti-human IgG (Dako) 
then in 1:2000 diluted FITC-conjugated streptavidin for 30 min each time. For 
fluorescence and confocal microscopy, wet mounts of immunostained parasites 
were prepared and images were obtained using a Leica DM LB2 and a Carl Zeiss 

15 Scanning Microscope, respectively. For both microscopes, suitable filters and 

channels were used to detect FITC and ethidium-bromide staining. 
When appropriate, trypsin treatment was performed as described (Fernandez et al., 
1999) with modifications. Cells were washed once in PBS, incubated in 10 volumes 
of 100 {ig/ml TPCK-treated trypsin (Amersham Pharmacia Biotech) in PBS for 10 

20 min at 37°C. The reaction was stopped with 1 volume of 2 mg/ml soybean trypsin 

inhibitor (Sigma-Aldrich) in RPMI 1640/5% Albumax. Cells were washed twice in 
PBS then used for immunostaining and flow cytometry. 

We next used Western blotting to investigate the protein translation of VAR4 (SEQ ID NO.: 

25 2) (mll-5). Analysis of PfEMPl expression of unselected late trophozoite/schizont stage 
3D7 using antiserum against the conserved intracellular ATS/Exon2 var domain showed a 
single band with an estimated molecular weight of 260 kDa (Fig. 16, Lane A). This 
indicates that most of these parasites expressed a PfEMPl with a four-domain structure, in 
correspondence to the results of the absolute quantification (Example 6, Table 2) that 

30 showed a dominant group C gene transcript encoding a PfEMPl molecule of this size. 
Antibody-selected 3D7 expressed a PfEMPl species of 260 kDa, but also expressed 
additional high-molecular weight bands (Fig. 16, Lane B). Again, these data agree with the 
results of the absolute quantification showing that the three most highly transcribed var 
genes in antibody-selected 3D7 were PF08_0107, MAL6P1.316, and more specifically 

35 PFD1235w/MAL7Pl.l (SEQ ID NO.: 1), which encode proteins with predicted molecular 
weights of approximately 260, 330, and 400 kDa, respectively. Furthermore, an antiserum 
against Baculovirus-defwed DBL5-5 of the PFD1235w/MAL7Pl.l (SEQ ID NO. : 1) gene 
product revealed a high-molecular weight band (Fig. 16, Lane D) corresponding to the top 
band in the ATS/Exon2-probed blot (Fig. 16, Lane B). An identical result was obtained with 

40 antiserum against Bacufovirus-derived CIDRl-cc of the PFD1235w/MAL7Pl.l (SEQ ID NO.: 
1) (data not shown). This indicates that PFD1235W/MAL7P1.1 (SEQ ID NO.: 1) is not only a 
major transcript in antibody-selected 3D7, but that this var gene is also translated into 
protein. 



WO 2005/063804 



PCT/DK2004/000922 



97 

Antibodies to Baculovirus-tierlved DBL5-5 of the PFD1235w/MAL7Pl.l (SEQ ID NO. : 1) 
gene product were tested on iRBC by flow cytometry, immuno-fluorescence and confocal 
microscopy (mll-6). Only a small percentage of RBC infected by unselected 3D7 reacted 
with the murine DBL5-5 antibodies (Fig. 17), although these iRBC did express VSA 
5 detectable by flow cytometry with human plasma (Fig. 17A3). By contrast, a large 
proportion of RBC infected with antibody-selected 3D7 were recognized by the murine 
antibodies, and this reactivity was abrogated by prior trypsin treatment of the iRBC (Fig. 
17B1). Fluorescence and confocal microscopy using the DBL5-8 murine antibodies showed 
a distinct punctate pattern on the surface of intact RBC infected by antibody-selected 3D7 
10 (Figs. 17B2, 17B5), but not by unselected 3D7 (Figs. 17A2, 17A5). An identical result was 
obtained with antiserum against Baculovirus-derlved CIDRl-a of the PFD1235w/MAL7Pl.l 
(SEQ ID NO.: 1) (data not shown). 

Taken together, these results indicate that the product of PFD1235W/MAL7P1.1 (SEQ ID 
15 NO.: 1 and 2) is expressed on the surface of antibody-selected 3D7 and that the protein 
confers a VSA SM phenotype. Similar results were obtained for PF11_0008 (SEQ ID NO.: 3 
and 4) and PF13_0003 (SEQ ID NO.: 5 and 6) (data not shown). 

Example 12: 

20 Monoclonal antibodies to VAR4 (SEQ ID NO.: 2), VAR5 (SEQ ID NO.: 4), VAR6 (SEQ ID 
NO.: 6) 

For production of monoclonal antibodies (mAb) mice were immunized by injection of 
different domains of VAR4, VAR5, and/or VAR6 to stimulate the production of antibodies 
25 targeting these domains. 

ml2~l. The antibody forming cells were isolated from the spleen of the mice. Monoclonal 
antibodies were produced by fusing single antibody-forming ceils to cancer cells 
(such as cells from myeloma) to make them immortal. The cells were grown using 
30 in vitro cell-culture techniques and cloned by limiting dilution. mAb secreted from 

the hybridomas were purified and used for identification by flow-cytometry of field 
P. falciparum isolates expressing VAR4, VAR5, and/or VAR6 or homologous hereof. 

This strategy resulted in mAb with different specificities and different avidity and the 
35 resulting mAb were used to identify epitopes involved in adhesion to receptors on human 
endothelial cells and to discriminate between even quite similar epitopes. 



Example 13: 

VAR4 (SEQ ID NO.: 2) recombinant fusion proteins are commonly and specifically 
40 recognised 
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If VAR4 (SEQ ID NO 2) mediates a VSA SM phenotype, It would be predicted that a high 
proportion of children in malaria endemic areas have acquired antibodies to this protein. 
To test this hypothesis we measured plasma levels of IgG against recombinant domains of 
the VAR4 protein by ELISA in asymptomatic individuals living under high malaria 
5 transmission intensity in Tanzania. 

To make recombinant proteins of VAR4 (SEQ ID NO.: 2) the PFD1235w/MAL7Pl.l (SEQ ID 
NO.: 1) NTS, DBLl-oc, CIDRl-a, DBL2-p, C2, DBL3-(3, DBL4-y, DBL5-5, and CIDR2-p were 
sub-cloned into the pGEX-4Tl vector or Gateway system by PCR using the following 
10 domain-specific oligonucleotide primers and a hot start taq polymerase (Qiagen) and 
PfuTurbo (Stratagene): 

pGEX primers: 

NTS. Fw: 5 ' - G AATTC ATG GG G A ATGC ATC ATC A- 3 ' 

NTS . Rv : 5 ' - ATA AG A ATG CG G CCG C ATTTCTATCTTCTG CGTT- 3 ' 

DBLl-a.Fw: 5 ' -GAATTCAACGCAGAAGATAGAAATC-3 ' 

DBLl-a..Rv: 5 ' -ATAAGAATGCGGCCGCCTTCGATGTATATGTCTG-3 ' 

CIDRl-a.Fw: 5 '-GAATTCGACGCTAAAACTGATAGTA-3 ' 

CIDRl-a.Rv: 5 '-ATAAGAATGCGGCCGCACATATATCGCCATTCAACG-3 ' 

DBL2-p.Fw : 5 ' - G A ATTCTTG A ATG G CG ATATATGTA- 3 ' 

DBL2-p.Rv: 5 '-ATAAGAATGCGGCCGCGTATATAGCTGATACTGT-3 ' 

C2 . Fw : 5 ' -G A ATTCACAGTATCAG CTATATACC- 3 ' 

C2 . Rv : 5 ' -ATAAGAATGCGGCCGCATAACATGGTTGTTGTGA-3 ' 

DBL3-p„Fw: 5 ' -G A ATTCTCAC A ACAACCATGTTATG - 3 ' 

DBL3-p..Rv: 5 ' -ATAAGAATGCGGCCGCATCTATTCCACCTGTAGT -3 ' 

DBL4-y.Fw : 5 ' - G AATTC ACTA CAG GTG G A ATAG ATC A- 3 ' 

DBL4-y.Rv : 5 ' -ATAAGAATGCGGCCGCACATGCGGCATTGAGACT-3 ' 

DBL5-5.FW : 5 ' - G A ATTC AGTCTC AATG CCG C ATGTG - 3 ' 

DBL5-5.Rv: 5 ' -ATAAGAATGCGGCCGCTCTACAATGTCTGGCACA-3 ' 

CIDR2-p.Fw: 5 ' -TCCCCCGGGTGTGCCAGACATTGTAGATC-3 ' 

CIDR2-p.Rv: 5 ' -ATAAGAATGCGGCCGCTTTGCCACTAGGTACGT-3 ' 

15 Gateway primers: 

5 ' -GGGGACAAGTTTGTACAAAAAAGCAGGCTTGATGGGAATGCATC ATCA-3 ' 

NTS.Fw: 

NTS. Rv : 5 ' -GGGGACCACT TTGTACA AG AA AG CTG G GTCTAG ATTTCTATCTTCTG CGT- 3 ' 
DBLl-a. Fw: 5' -GGGGACAAGTTTGTACAAAAAAGCAGGCTTGAACGCAGAAGATAGAAATC-3 ' 
DBLl-a. Rv: 5 ' - G G G G AC A AGTTTGTACA A A A A AG CAG G CTTG AACG CAG A AG ATAG AA ATC- 3 ' 
CIDRl-a.Fw: 5 ' -GGGGACAAGTTTGTACAAAAAAGCAGGCTTGGACGCTAAAACTGATAGTA-3 ' 
CIDRl-a.Rv: 5 ' -GGGGACCACTTTGTACAAGAAAGCTGGGTCTAATATATCGCCATTCAACG-3 ' 
DBL2-p.Fw : 5 ' -GGGGACAAGTTTGTACAAAAAAGCAGGCTTG TTG A ATG G CG ATATATGTA-3 ' 
DBL2-p.Rv: 5 ' -GGGGACCACTTTGTACAAGAAAGCTGG GTCTAG GTATATAGCTG ATA CTG-3 ' 
C2 . Fw : 5 ' -GGGGACAAGTTTGTACAAAAAAGCAGGCTTG ACAGTATCAGCTATATACC-3 ' 
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C2.Rv: 5 
DBL3-0.Fw: 5 
DBL3-(3.Rv: 5 
DBL4-y.Fw: 5 
DBL4-y.Rv: 5 
DBL5-5.Fw: 5 
DBL.5-5.Rv: 5 
CIDR2-p.Fw: 5 
CIDR2-p.Rv: 5 



-GGGGACCACTTTGTACAAGAAAGCTGGGTCTACATAACATGGTTGTTGTG-3 ' 
-GGGGACAAGTTTGTACAAAAAAGCAGGCTTG TCACAACAACCATGTTATG-3 ' 
-GGGGACCACTTTGTACAAGAAAGCTGGGTCTATGATCTATTCCACCTGTAG-3 ' 
-GGGGACAAGTTTGTACAAAAAAGCAGGCTTG ACTACAGGTGGAATAGATCA-3 
-GGGGACCACTTTGTACAAGAAAGCTGGGTCTACACATGCGGCATTGAGAC-3 ' 

' -GGGGACAAGTTTGTACAAAAAAGCAGGCTTGAGTCTCAATGCCGCATGTG-3 ' 
-GGGGACCACTTTGTACAAGAAAGCTGGGTCTAATCTACAATGTCTGGCAC-3 
-GGGGACAAGTTTGTACAAAAAAGCAGGCTTG TGTGCCAGACATTGTAGATC-3 ' 

' -GGGG ACCACTTTGTACAAGAAAGCTGGGTCTAGTTTGCCACTAGGTACGT-3 ' 



The proteins encoding single domains were expressed as fusion proteins (E. coif strain 
BL21) at the carboxyterminus of glutathione S-transferase from Schistosoma japonicum, 
and purified by affinity chromatography on glutathione sepharose 4B (Amersham 
5 Pharmacia Biotech) 



Materials and methods 

ml3-l. We used plasma samples from 20 children (3-4 and 10-11 years of age) and 10 

10 young adults (18-19 years of age) living in Mgome village in the Tanga region of 

Tanzania for ELISA analysis of antibody responses to purified recombinant NTS, 
DBL3-p, CIDRl-aand DBL5-5 of VAR4-GST domains as previously described. 
Briefly, proteins were diluted in 0.1 M glycine/HCI (pH 2.75). The wells of Maxisorp 
micro titre plates (Nunc, Roskilde, Denmark) were coated with antigen by overnight 

15 incubation at 4°C. The plates were emptied, and any residual binding capacity was 

blocked with 100 \x\ of blocking buffer (1% bovine serum albumin, 0.5 M NaCI, 1% 
Triton-X-100 in phosphate-buffered saline (PBS), pH 7.2) per well. After incubation 
for 0.5 hr at room temperature, the plates were washed four times with washing 
buffer (PBS, 0.5 M NaCI, 1% Triton-X-100, pH 7.4) and 100 ix\ of plasma diluted 

20 1:200 in blocking buffer was added to each well. The plates were then incubated 

for one hour at room temperature, and then washed and incubated for one more 
hour at room temperature with peroxidase-conjugated rabbit anti-human 
immunoglobulin G (IgG) (Dako, Glostrup, Denmark) diluted 1:1000 in blocking 
buffer. Subsequently, the plates were washed and 100 \x\ of o-phenylenediamine 

25 substrate (0.30%, Dako) diluted in 0.1 M sodium citrate buffer (pH 5.0) with 

0.05% (v/v) H 2 0 2/ was added to each well. Finally, the plates were incubated at 
room temperature in the dark before the addition of 100 \x\ of 2.5 M H 2 S0 4 and the 
optical density (OD) was measured at 492 nm. Control plates were coated with GST 
alone. Cut-off values were calculated as the mean ELISA unit plus 2 standard 

30 deviations obtained with sera of 13 Danish blood donors without malaria exposure 

as described in (Jakobsen et al., 1993). 



ml3-2. Competition ELISA was done using recombinant CIDRl-a domains of 

VAR4 (SEQ ID NO.: 2) and varl (PFE1640w) using three different plasma samples 
35 of high, medium, and low reactivity with the proteins from two children aged 3 
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years and one child aged 11 years. Blocking of plasma was done using 0.1, 1, 5, 
and 10 jag/ml of recombinant protein for 2 hr at room temperature. The test 
plasma samples were diluted 1:50 and tested as described above. 

5 Most parasite-exposed children and adults had IgG directed against DBL5-5 (Fig. 18A), 
CIDRl-ct (Fig. 18B), DBL3-J3 (Fig. 18C), and NTS (Fig. 18D), with comparable levels in 
young children and adults. To test the antigen specificity of the antibody recognition, we 
performed competition ELISA using the CIDRl-a domains of the VAR4 (SEQ ID NO.: 2) 
and varl (PFE1640w) proteins. Pre-incubation of the plasma with homologous recombinant 
10 protein caused a dose-dependent reduction in OD 492 values (Figs. 18E, 18F), whereas the 
heterologous protein (Figs. 18E, 18F) did not change the plasma reactivity. 

These result showing that a high proportion of young children recognises recombinant 
domains of VAR4 and the demonstration that our DBL5-5 antiserum did not cross react 

15 with products of MAL6P1.316, and PF08_0107 in Western blotting, suggest that the plasma 
antibody reactivity to recombinant VAR4 (SEQ ID NO.: 2) protein was the result of 
exposure to parasites expressing PfEMPl resembling that encoded by PFD1235w/MAL7Pl.l 
(SEQ ID NO.: 1), rather than being due to a broad cross-reactivity between different CIDR 
or DBL domains. Similar results were obtained when using VAR5 (SEQ ID NO. : 4) and 

20 VAR6 (SEQ ID NO.: 6) (data not shown). 

Example 14: 

Experimental infection of humans results in transcription and transiation of 
PFD1235W/MAL7P1.1 (SEQ ID NO.: 1 and 2) 

25 

We have proposed that the dominance of parasites expressing VSA SM -type antigens among 
non-immune patients is related to their higher growth rate in such individuals, and that the 
shift towards VSA UM -type antigens occurs as this strong selective advantage of VSA S m" 
expressing parasites gradually disappears as VSA S M-specific immunity is acquired. 

30 

Materials and methods: 

ml4-l. We used parasites isolated on days 8, 9, and 10 from a Dutch volunteer exposed 
on day 0 to mosquitoes infected by P. falciparum isolate NF54 as part of ongoing 
35 studies of experimental P. falciparum infections (Hermsen et ai., 2001). These 

parasites were cultured in vitro for 27 (day 8 and day 9 isolates) or 33 days 
(day 10 isolate) to obtain sufficient parasites for DNA/RNA analysis. 

ml4-2. SDS-extracted trophozoite/schizont-stage iRBC were reduced by boiling in the 
40 presence of L-mercaptoethanol and electrophoresed in the Laemmli sodium dodecyl 

sulfate polyacrylamide gel electrophoresis (SDS-PAGE) system (Sambrook et al., 
1989). Immunoblots were prepared on PVDF membranes by semidry blotting using 
standard methods. All available binding sites on membranes were blocked in Tris- 
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buffered-saline-Tween (TBS-T) containing 5% skimmed milk. Blots were probed 
with an antiserum raised against a recombinant protein of the highly conserved 
intracellular acidic segment ATS/Exon2 conserved among most var genes in 3D7, 
DBL5-5 of VAR4 (SEQ ID NO.: 2), and pre-immune mouse sera diluted 1:100 in 
5 TBS-T. Bound antibody was detected with relevant IgG alkaline phosphatase- 

conjugated antibody (Dako, Glostrup, Denmark). All washes were done using TBS- 
T. Blots were developed by standard methods using p-nitroblue tetrazolium 
phosphate (NBT, Sigma) and 5-bromo-4-ch!oro-3-indolyiphosphate (BCIP, Sigma) 
as substrates. 

10 

To further study this hypothesis we studied parasites rescued on days 8, 9, and 10 from a 
Dutch volunteer receiving a mosquito-transmitted P. falciparum NF54 infection on day 0. 
NF54 was originally isolated from a non-immune Dutch malaria patient, and isogenic with 
the 3D7 parasites cloned from it ( Salanti et al., 2003). Western blots of Day 10 parasites 

15 probed with antiserum against the DBL5-5 domain of the PFD1235w/MAL7Pl.l gene 

product, detected a high-molecular weight band (Fig. 16, Lane G), which corresponded in 
size to that observed in antibody-selected 3D7 (Fig. 16, Lane B). This band was not 
detected in similar blots of parasites obtained on Day 8 and Day 9 (Fig. 16, Lanes E-F). In 
line with these findings, we observed high PFD1 235w/MAL7Pl . 1 (SEQ ID NO.: 1) copy 

20 numbers relative to those of the seryl tRNA synthetase gene on Day 10, but not on Day 8, 
corresponding to a 38.30 and 8.4 fold increase in mRNA copies of this var gene relative to 
the copy number of seryl-tRNA synthetase mRNA between Day 8 and Day 10 in rings and 
trophozoites, respectively (data not shown). Judged by comparison of Ct-values, ring- 
stage transcription of PFD1235w/MAL7Pl . 1 (SEQ ID NO.: 1) was relatively low on Day 8 

25 (rank: 55/60), but high on Day 10 (rank: 9/60). The corresponding rankings in 
trophozoites were 48 on Day 8 and 18 on Day 10. 

Taken together, these data show that PFD1235w/MAL7Pl.l (SEQ ID NO.: 1) is translated 
into a functional protein in vivo, and indicate that this protein may be associated with fast- 
30 growing parasites dominating acute P. falciparum malaria in non-immune patients. Similar 
results were obtained for PF11„0008 (SEQ ID NO.: 3) and PF13„0003 (SEQ ID NO.: 5) 
(data not shown). 



Example 15: 

35 

Anti-adhesion assay 

It is becoming increasingly apparent that acquired protective immunity to P. falciparum 
infection relies on antibodies (Abs) specifically recognizing variant parasite antigens 
expressed on the surface of late stage-infected erythrocytes. In this scenario, only 
40 parasites expressing variant antigens to which the host does not possess adequate specific 
Ab are likely to cause disease, and immunity is likely to depend on the accumulation of a 
large panel of Ab specificities recognizing different variants of such antigens. Severe 
malaria is often associated with sequestration of large quantities of parasites in the brain. 
Parasites causing severe malaria have been shown to adhere receptors on endothelial 
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ceils. To show that VAR4, VAR5, and VAR6 are responsible for in vitro adhesion of 3D7 SM i 
and 3D7 en do parasites to endothelial receptors, an antibody adhesion assay with murine 
antibodies against VAR4, VAR5, and VAR6 were performed. 

5 Materials and methods: 

ml5-l. Antiadhesion was measured by 3 H labeled parasites: For use in adhesion assays, 
parasite cultures with a parasitemia of ™1% late trophozoites and schizonts were 
first transferred from Albumax II medium (Life Technologies), with a high 
concentration of hypoxanthine (Hpx), into RPMI 1640 plus 5% normal human 

10 serum (low Hpx) and maintained for 24 h. The parasites then were labeled by 

exposure to [3H]Hpx (Amersham; 8.75 MBq/mL of RBCs) for another 24 h. Finally, 
the cultures were enriched for late-stage iRBCs and incubated for 30 min, with or 
without test plasma. Endothelial cells were grown on Microtiter plates (Falcon; 
Becton Dickinson) then blocked with bovine serum albumin (BSA; 20 mg/mL, 100 

15 iaL/well) in PBS at room temperature for 30 min. We added enriched [ 3 H]Hpx- 

labeled late-stage iRBCs to wells containing endothelial cells (2 x 10 6 cells/well) 
and incubated the wells at 37°C for 1 h. Nonadherent iRBCs were removed by 4 
washes in RPMI 1640. Adherent iRBCs were harvested onto glass fiber pads, and 
the [ 3 H]Hpx activity was measured in a liquid scintillation counter (Beckman 

20 Coulter). Inhibition of iRBC adhesion by plasma was calculated as 1 - 

(testEndothelialCell - controlBSA)/controlEndothelialCeil - controlBSA), where 
testEndothelial is counts per minute of iRBCs preincubated with plasma and 
adhering to wells containing endothelial cells, and controlEnothelial cells and 
controlBSA refer to counts per minute of iRBCs not preincubated with plasma and 

25 adhering to endothelial cells and BSA-coated wells, respectively. 

Cytoadhesion of endothelial cells were significantly inhibited by plasma from individuals 
suffering from severe malaria, and more importantly binding of 3D7 SM1 and 3D7 en do to 
endothelial cells were strongly inhibited by the murine anti-VAR4, -VAR5, and -VAR6 
30 antibodies. Antibodies raised against recombinant VAR4, -VAR5, and -VAR6 inhibit parasite 
adhesion to endothelial cells in vitro (data not shown). 

An obvious consequence of this finding is that vaccine induced antibodies against SEQ ID 
NO.: 2, 4, and/or 6 constructs can hinder binding of parasites to endothelial tissue and 
35 thus prevent severe malaria. 



Example 16: 

Identification of receptors and VAR4 (SEQ ID NO.: 2), VAR5 (SEQ ID NO.: 4), and/or VAR6 
(SEQ ID NO.: 6) sites binding receptors on endothelial cells in vitro 
40 To further study endothelial cell adhesion all domains were cloned into the pDISPi-AY 
vector (Invitrogen) using the following primers as examplified for VAR4: 



NTS.Fw: 



5 ' -TCCCCCGGGATGGGGAATGCATCATCA-3 ' 
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NTS . Rv : 5 ' -TCCC CG C G G ATTTCTATCTTCTG CGTT- 3 ' 
DBLl-a.Fw: 5 '-TCCCCCGGGAACGCAGAAGATAGAAATC-3 ' 
DBLl-a..Rv: 5 '-TCCCCGCGGCTTCGATGTATATGTCT-3 ' 
CIDRl-a.Fw: 5 ' -TCCCCCGGGGACGCTAAAACTGATAGTA-3 ' 
CIDRl-a.Rv: 5 '-TCCCCGCGGACATATATCGCCATTCAACG-3 ' 
DBL2-p.Fw: 5 ' -TCCCCCGGGTTGAATGGCGATATATGTA-3 ' 
DBL2-p.Rv: 5 ' -TCCCCG CG GGTATATAG CTG ATA CTGT- 3 ' 
C2. Fw : 5 ' -TCCCCCGGGACAGTATCAGCTATATACC-3 ' 

C2.Rv: 5 ' -TCCCCG CG G ATA ACATG GTTGTTGTG A- 3 ' 

DBL3-p..Fw: 5 ' -TCCCCCG G GTCAC AAC AA CC ATGTTATG - 3 ' 
DBL3-p..Rv: 5 ' -TCCCCG CGGATCTATTCCACCTGTAGT-3 ' 
DBL4-y.Fw: 5 ' -TCCCCCGGGACTACAGGTGGAATAGATCA-3 ' 
DBL4-y.Rv: 5 ' -TCCCCGCGGACATGCGGCATTGAGACT-3 ' 
DBL5-5.Fw: 5 ' -TCCCCCG GGAGTCTCAATGCCGCATGTG- 3 ' 
DBL5-5.Rv: 5 ' -TCCCCG CGGTCTACAATGTCTGGCACA- 3 ' 
CIDR2-p.Fw: 5 '-TCCCCCGGGTGTGCCAGACATTGTAGATC-3 ' 
CIDR2-p.Rv: 5 ' -TCCCCG CGGTTTGCCACTAGGTACGT- 3 ' 

The ability of the different domains to bind directly to endothelial cells was in this example 
assayed using a mammalian expression system. 

5 ml 6-1. Domains were cloned into the pDisplay vector (Invitrogen). This vector allows 
display of cloned proteins on the ceil surface. Each domain was fused at the N- 
terminus to the murine Ig K-chain leader sequence, which targets the protein to the 
cell surface, and at the C-terminus to the platelet derived growth factor receptor 
(PDGFR) transmembrane domain, which anchors the protein to the cell membrane. 

10 

A human non-adherent T ceil and a CHO cell line was used for transient expression of the 
recombinant proteins. This approach has enabled us to study cell adhesion to endothelial 
cells. Blockage of this binding was studied using relevant recombinant domains expressed 
in the Baculovirus system (Example 11) and commercial antibodies directed against 
15 specific endothelial receptors. 

VAR4 (SEQ ID NO.: 2), VAR5 (SEQ ID NO. : 4), and/or VAR6 (SEQ ID NO.: 6) regions 
responsible for receptor binding were identified using random mutagenesis. 



Example 17: 

20 To express VAR4 (SEQ ID NO.: 2) in eucaryotic organisms the Exon 1 ranging from nt 1 to 
9443 (SEQ ID NO.: 1 ) was subjected to a fail recodonisation. 

An artificial codon table was generated by combining the codon usage of Spodoptera 
frugiperda and Homo sapiens genes. The codon bias of the synthetic genes were adapted 
25 to this "artificial" codon usage. In addition, regions of very high (> 80%) or very low 

(<30%) GC content was avoided and the GC-content was adjusted to 50% where possible. 
During the optimization process following cis-acting sequence motifs were avoided: 
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internal TATA-boxes, chi-sites and ribosomal entry sites, AT-rich or GC-rich sequence 
stretches, repeat sequences and RNA secondary structures, and (cryptic) splice donor and 
acceptor sites, branch points 

5 No reverse-complementary sequence identities longer than 20 nucleotides are found when 
the optimized sequence is aligned to the transcriptom of Homo sapiens. No RNA 
interference should therefore be expected. The entire gene was as two fragments of 4641- 
{Kpnl/Xhol cloning sites,) and 4838-bp (XhoI/SacI cloning sites) fragment and cloned into 
pUC-Kana and pCR-Script-Amp (Stratagene, CA, USA), respectively. The two subfragments 
10 were subcloned as one fragment of 9473-bp into pUOKana (KpnI/SacI cloning sitesj. 

The recodontsed VAR4 exonl sequence is listed as SEQ ID NO.: 7. 

The recodonised PFD1235w/MAL7Pl.l (SEQ ID NO.: 7) domains: NTS, 

15 DBLl-a, CIDRl-a, CIDRl-y, DBL2-(3, DBL2-y, C2, 

DBL3-P, DBL3-y, DBL4-y, DBL4-5, DBL4-p, DBL5-5, DBL5-p, and CIDR2p were cloned into the 
Baculovirus transfer vector pAcGP67 (BD Biosciences, Br0ndby, Denmark) for expression 
of protein in insect cells. The same domains were cloned into the pDisplay and pRE4 
vectors for surface expression on T-, CHO- and COS cells and the Eucaryotic TA expression 

20 vector pCR3.1 (Invitrogen) for DNA immunizations using the following primers: 

pAcGP67 primers: 



NTS.Fw-opt: 5 '-GAATTCATGGGCAACGCCTCCTCC-3 ' 
NTS . Rv-opt : 5 ' -GTTCCTGTCCTCGG CGTT- 3 ' 
DBLl-a.Fw-opt: 5 ' -GAATTCAACGCCGAGG ACAGGAACC-3 ' 
DBLl-a.Rv-opt: 5 ' -CTTGCTGGTGTAGGTCTG-3 ' 
CIDRl-a.Fw-opt: 5 ' -GAATTCGACGCTAAGACCGACTCCA-3 ' 
CIDRl-a.Rv-opt: 5' - GATGTCGCCGTTCAGGGT-3 ' 
DBL2-p.Fw-opt: 5 ' -GAATTCACCCTGAACGGCGACATC-3 ' 
DBL2-(3.Rv-opt: 5 'GTAGATGGCGGACACGGT-3 ' 
C2. Fw-opt: 5 ' -GAATTCACCGTGTCCGCCATCTAC-3 ' 

C2. Rv-opt: 5 '-GTAGCAGGGCTGCTGGGA-3 ' 

DBL3-p.Fw-opt: 5 ' -GAATTCTCCCAGCAGCCCTGCTAC-3 ' 



DBL3-p.. Rv-opt: 5 GTCGATGCCGCCGGTGGT-3 ' 
DBL4-y.Fw-opt: 5 ' -GAATTCACCACCGGCGGCATCGAC-3 ' 
DBL4-y.Rv-opt: 5 '-GCAGGCGGCGTTCAGGGA-3 ' 
DBL5-5.Fw-opt: 5 ' -GAATTC TCCCTGAACGCCGCCTGC-3 ' 
DBL5-8.Rv-opt: 5 ' -GCGGCAGTGCCTGGCGCA-3 ' 
CIDR2-p.Fw-opt: 5 '-GAATTCTGCGCCAGGCACTGCCGC-3 ' 
CIDR2-p.Rv-opt: 5 ' - CTTTCCG G A AG G C A CGTT- 3 ' 
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pDisplay primers: 

NTS.Fw-opt: 5 ' -TCCCCCGGGATGGGCAACGCCTCCTCC-3 ' 
NTS. Rv-opt : 5 ' -TCCCCGCGGGTTCCTGTCCTCGGCGTT-3 ' 
DBLl-a.Fw-opt: 5 ' -TCCCCCGGGAACGCCGAGGACAGGAAC-3 ' 
DBLl-a. Rv-opt: 5 ' -TCCCCGCGGCTTGCTGGTGTAGGTCTG-3 ' 
CIDRl-a. Fw-opt: 5 ' -CCCGGGGACGCTAAGACCGACTCC-3 ' 
CIDRl-a. Rv-opt: 5 '-TCCCCGCGGGATGTCGCCGTTCAGGGT-3 ' 
DBL2-p.Fw~opt: 5 CCCGGGCTGAACGGCGACATCTGC-3 ' 
DBL2-p.Rv-opt: 5 ' -TCCCCGCGGGTAGATGGCGGACACGGT-3 ' 
C2.Fw-opt: 5 ' -CCCGGGACCGTGTCCGCCATCTAC-3 ' 

C2. Rv-opt: 5 '-TCCCCGCGGGTAGCAGGGCTGCTGGGA-3 ' 

DBL3-p.Fw-opt: 5 '-CCCGGGTCCCAGCAGCCCTGCTAC-3 ' 
DBL3-p..Rv-opt: 5 ' -TCCCCGCGGGTCGATGCCGCCGGTGGT-3 ' 
DBL4~y.Fw-opt: 5 ' - CCCGGGACCACCGGCGGCATCGAC-3 ' 
DBL4-y.Rv-opt: 5 '-TCCCCGCGGGCAGGCGGCGTTCAGGGA-3 ' 
DBL5-5.Fw-opt: 5 ' -CCCGGGTCCCTGAACGCCGCCTGC-3 ' 
DBL5-5.Rv-opt: 5 ' -TCCCCGCGGGCGGCAGTGCCTGGCGCA-3 ' 
CIDR2-p.Fw-opt: 5 ' -CCCGGGTGCGCCAGGCACTGCCGC-3 ' 
CIDR2-p.Rv-opt: 5 '-TCCCCGCGGCTTTCCGGAAGGCACGTT-3 ' 

pRE4 primers: 

NTS.Fw-opt: 5 GACCAGCTGATGGGCAACGCCTCCTCC-3 ' 
NTS. Rv-opt: 5 ' -ACCGGGCCCGTTCCTGTCCTCGGCGTT-3 ' 
DBLl-a.Fw-opt: 5 ' -GACCAGCTGAACGCCGAGGACAGGAAC-3 
DBLl-a. Rv-opt: 5 '-ACCGGGCCCCTTGCTGGTGTAGGTCTG-3 ' 
CIDRl-a. Fw-opt: 5 '-GACCAGCTGGACGCTAAGACCGACTCC-3 ' 
CIDRl-a. Rv-opt: 5 '-ACCGGGCCCGATGTCGCCGTTCAGGGT-3 ' 
DBL2-p.Fw~opt: 5 ' -GACCAGCTGCTGAACGGCGACATCTGC-3 ' 
DBL2-p.Rv-opt: 5 '-ACCGGGCCCGTAGATGGCGGACACGGT-3 ' 
C2. Fw-opt: 5 ' -GACCAGCTGACCGTGTCCGCCATCTAC-3 ' 

C2. Rv-opt: 5 ' -ACCGGGCCCGTAGCAGGGCTGCTGGGA-3 ' 

DBL3-p.Fw-opt: 5 ' -GACCAGCTGTCCCAGCAGCCCTGCTAC-3 ' 
DBL3-p..Rv-opt: 5 ' -ACCGGGCCCGTCGATGCCGCCGGTGGT-3 ' 
DBL4-y.Fw-opt: 5 '-GACCAGCTGACCACCGGCGGCATCGAC-3 ' 
DBL4-y.Rv-opt: 5 ' -ACCGGGCCCGCAGGCGGCGTTCAGGGA-3 ' 
DBL5-5.Fw-opt: 5 ' -GACCAGCTGTCCCTGAACGCCGCCTGC-3 ' 
DBL5-5.Rv-opt: 5 ' ACCGGGCCCGCGGCAGTGCCTGGCGCA-3 ' 
CIDR2-p.Fw-opt: 5 ' -GACCAGCTGTGCGCCAGGCACTGCCGC-3 ' 
CIDR2-p.Rv-opt: 5 '-ACCGGGCCCCTTTCCGGAAGGCACGTT-3 ' 

5 



pCR3.1 primers: 
NTS.Fw-opt: 
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5 '-GCCRCCATGGGCAACGCCTCCTCC-3 ' 
NTS. Rv-opt : 5 ' -CTAGTTCCTGTCCTCGGCGTT-3 ' 
DBLl-cc.Fw-opt: 5 ' -GCCRCCATGAACGCCGAGGACAGGAACC-3 
DBLl-a.Rv-opt: 5 ' - CTACTTGCTG GTGTAGGTCTG - 3 ' 
CIDRl-a.Fw-opt: 5 ' -GCCRCCATGGACGCTAAGACCGACTCCA-3 ' 
CIDRl-a.Rv-opt: 5'- CTAGATGTCGCCGTTCAGGGT-3 ' 
DBL2-p.Fw-opt: 5 ' -GCCRCCATG ACCCTGAACGGCGACATC-3 ' 
DBL2~p.Rv-opt: 5 ' CTAGTAGATGGCGGACACGGT-3 ' 
C2.Fw-opt: 5 ' - G CC RCC ATG ACCGTGTCCG CCATCTAO 3 ' 

C2. Rv-opt: 5 '-CTAGTAGCAGGGCTGCTGGGA-3 ' 

DBL3-p.Fw-opt: 5 ' -GCCRCCATGTCCCAGCAGCCCTGCTAC-3 ' 



DBL3-p.. Rv-opt: 5 '- CTAGTCG ATG CCGCCGGTG GT-3 ' 
DBL4-y.Fw-opt: 5 ' -GCCRCCATGACCACCGGCGGCATCGAC-3 ' 
DBL4-y.Rv-opt: 5 ' -CTAGCAGGCGGCGTTCAGGGA-3 ' 
DBL5-5.Fw-opt: 5 ' -GCCRCCATGTCCCTGAACGCCGCCTGC-3 ' 
DBL5-6.Rv-opt: 5 '-CTAGCGGCAGTGCCTGGCGCA-3 ' 
CIDR2-p.Fw-opt: 5 '-GCCRCCATGTGCGCCAGGCACTGCCGC-3 ' 
CIDR2-p.Rv-opt: 5 ' -CTACTTTCCGGAAGGCACGTT-3 ' 



Example 18: 

Field isolates of P. falciparum causing severe malaria shows high transcription of 
5 PFD1235W/MAL7P1.1 (SEQ ID NO. 1), PF11_0008 (SEQ ID NO.: 3), and/or PF13_0003 
(SEQ ID NO.: 5) or homologues hereof. 

From Example 3, 4, 5, 6 and 7 it appears that the Group A PFD1 235w/MAL7Pl . 1 (SEQ ID 
NO. 1), PF11_0008 (SEQ ID NO.: 3), and/or PF13_0003 (SEQ ID 5) gene encodes VSA SM - 
10 like proteins, and that these proteins are involved in the acquisition of the VSA SM 

phenotype following antibody-selection of VSA UM -expressing 3D7 (3D7 SM ) and adhesion to 
endothelial cells (3D7 en do)- 



Materials and methods 

15 ml8.1. Real-time PGR was done using a Rotorgene thermal cycler system 

( http://www.corbettresearch.com ) following purification of RNA and synthesis of 
cDNA (m5.1). The transcription level of PFD1235w/MAL7Pl.l (SEQ ID NO. 1), 
PF11_0008 (SEQ ID NO.: 3), and PF13_0003 (SEQ ID NO.: 5) or homologues of 
hereof were measured using primers targeting this sequence. The absolute copy 

20 number was calculated (m6.3) and compared to the absolute copy number of other 

var genes present in the field isolates as well as housekeeping genes such as seryl- 
tRNA synthetase. Reactions were performed as in (m6.2) 
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The transcription of PFD1235w/MAL7Pl.l (SEQ ID NO. 1), PF11_0008 (SEQ ID NO.: 3), 
and/or PF13_0003 (SEQ ID NO.: 5) or homofogues hereof were found to be higher than 
other var genes in parasites causing severe malaria, but iow in parasites causing 
5 uncompiicated malaria (data not shown). 

Example 19: 

Plasmodium falciparum field isolates causing severe malaria expresses VAR4 (SEQ ID NO.: 
2) on the surface of infected erythrocytes 

10 

Analysis of VAR4 (SEQ ID NO.: 2) expression of RBC infected with parasites causing severe 
malaria using antibodies to Bacuiovirus-derived DBL5-5 of the PFD1235w/MAL7Pl.l gene 
product revealed a high-molecular weight band of approximately 400 kDA corresponding to 
the size of the VAR4 expressed by 3D7 SM (Example 6), which was absent in parasites with 
15 a VSA UM phenotype (data not shown). 

Taken together, the results of Example 18 and 19 indicates that PFD1235w/MAL7Pl . 1 
(SEQ ID NO.: 1) or homologues hereof are not only transcribed, but also expressed on the 
surface of RBC infected with parasites having a VSA SM phenotype and causing severe 
20 malaria, but not parasites causing uncomplicated malaria and having a VSA VM phenotype. 
Similar results were obtained for PF11_0008 (SEQ ID NO.: 3), and PF13_0003 (SEQ ID 
NO.: 5) (data not shown). 

Example 20: 

25 Plasmodium falciparum field isolates causing severe malaria adhere to endothelial cells and 
adhesion is inhibited by SEQ ID NO. 2, 4, and/or 6 specific antibodies 

To show that VAR4, VAR5, and/or VAR6 are responsible for in vitro adhesion of field 
isolates to endothelial receptors, an antibody adhesion assay with murine antibodies 
30 against VAR4, VAR5, and VAR6 were performed. 

Materials and methods: 

m20-l. Antiadhesion was measured by 3 H labeled parasites: For use in adhesion assays, 
parasite cultures with a parasitemia of ^1% late trophozoites and schizonts were 

35 first transferred from Albumax II medium (Life Technologies), with a high 

concentration of hypoxanthine (Hpx), into RPMI 1640 plus 5% normal human 
serum (iow Hpx) and maintained for 24 h. The parasites then were labeled by 
exposure to [3H]Hpx (Amersham; 8.75 MBq/mL of RBCs) for another 24 h. Finally, 
the cultures were enriched for late-stage iRBCs and incubated for 30 min, with or 

40 without test plasma. Endothelial cells were grown on Microtiter plates (Falcon; 

Becton Dickinson) then blocked with bovine serum albumin (BSA; 20 mg/mL, 100 
Hl_/well) in PBS at room temperature for 30 min. We added enriched [ 3 H]Hpx- 
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labeled late-stage IRBCs to wells containing endothelial cells (2 x 10 6 cells/well) 
and incubated the wells at 37°C for 1 h. Nonadherent iRBCs were removed by 4 
washes in RPMI 1640. Adherent iRBCs were harvested onto glass fiber pads, and 
the [ 3 H]Hpx activity was measured in a liquid scintillation counter (Beckman 
5 Coulter). Inhibition of iRBC adhesion by plasma was calculated as 1 - 

(testEndothelialCell - controlBSA)/controlEndothelialCell - controlBSA), where 
testEndothelial is counts per minute of iRBCs preincubated with plasma and 
adhering to wells containing endothelial cells, and controlEnothelial cells and 
controlBSA refer to counts per minute of iRBCs not preincubated with plasma and 
10 adhering to endothelial cells and BSA-coated wells, respectively. 

Cytoadhesion to endothelial cells were significantly inhibited by plasma from young semi- 
immune children, and more importantly binding of field isolates to endothelial cells were 
strongly inhibited by the murine anti-VAR4, -VAR5, and -VAR6 antibodies. In this example 
15 it is shown that antibodies raised against recombinant VAR4, -VAR5, and -VAR6 inhibit 
field parasite isolate adhesion to endothelial cells in vitro. 

An obvious consequence of this finding is that vaccine induced antibodies against SEQ ID 
NO.: 2, 4, and/ or 6 constructs can hinder binding of parasites to endotheiiai tissue and 
20 thus prevent severe maiaria. 



Example 21: 

A novei DynaBead based method and adhesion to endotheiiai ceiis seiects for RBC infected 
with parasites having a VSA S m phenotype and which transcribes var genes with simiiar 
25 characteristics 

Selection of parasites using a novel antibody-DynaBead based method (Example 3) and 
adhesion to endothelial cells (Example 7) leads to changes in the serological phenotype of 
the 3D7 parasites from being of a VSA UM - to a VSA SM -type (Example 3 and Figs. 5C-D). 
30 This VSA SM phenotype resembles that of parasites causing severe malaria (Example 1 and 
Fig. 1). Thus, this method and results provided by it forms the basis for identification of 
PfEMPl molecules that could be used as part of a vaccine against severe malaria. 

Using this method we were able to identify three different var genes PFD1235w/MAL7Pl.l 
35 (SEQ ID NO.: 1), PF11_0008 (SEQ ID NO.: 3), and PF13_0003 (SEQ ID NO.: 5) that were 
highly transcribed and upregulated in 3D7 SM (Figs. 13, 14, and 20, Example 6 and 7, Table 
2). The protein products of these three genes are responsible for the serological phenotype 
and adhesive properties of 3D7 SM . Interestingly, these three genes do not belong to the 
varl, var2 gene subfamily as defined previously (Salanti et al. 2002, Salanti et al. 2003) or 
40 Group B, Group C, and Group B/C var genes as defined recently (Lavstsen et al. 2003). 
They lack 1-2 cysteine residues in DBLa homology group G (Smith et al. 2000) compared 
to most PfEMPl molecules in Group B, C, and B/C. Additionally, they posses a CIDRla 
distinctly different from the CIDRla domain of the MCvarl PfEMPl gene product expressed 
by Malayan Camp parasites and of the A4VAR expressed by A4 parasites that have been 
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shown to bind CD36 (Baruch et al. 1997; Cooke et al. 1998; Bryan et al. 2003; Smith et 
al. 1998). By contrast, 3D7 SM that transcribes PFD1235w/MAL7Pl.l (SEQ ID NO.: 1), 
PF11_0008 (SEQ ID NO.: 3), and/or PF13_0003 (SEQ ID NO.: 5) do not bind to CD36 
(Example 4 and 5). 

5 

In conclusion, the three genes are present and expressed in field isolates causing severe 
malaria, but not in parasites with a VSA UM phenotype (Example 18, 19, and 20) indicating 
that the serve similar functions and might be responsible for the pathogenesis of severe 
malaria. 

10 

Example 22: 

In this example we show that CD36-selection of a 3D7 SM initially expressing VAR4 on its 
surface leads to loss of VAR4 surface expression and confirm that 3D7 SM expressing VAR4 
do not bind to CD36. 

15 

m22.1 A freshly antibody selected 3D7 SM (Example 3) expressing VAR4 on the surface was 
selected for binding to CHO cells expressing human CD36. Binding to CHO-CD36 was done 
as monitored as in Example 4. The expression of VAR4 on the surface of CD36 selected 
3D7sMwas analysed by FACS and Western blotting as described in Example 11 using 
20 antiserum against DBL55 and CIDRla of VAR4 (SEQ ID NO.: 2). 

The majority of freshly antibody selected RBC infected by 3D7 SM showed surface 
expression of VAR4 (Fig. 21, Panel A). Three rounds of selection of 3D7 SM for binding to 
human CD36 lead to reduced expression of VAR4 (Fig. 21, Panel C) with a complete loss of 
25 VAR4 expression following the fourth selection of 3D7 SM to CD36 (Fig. 21 Panel D). The loss 
of VAR4 expression remained stable following the fifth and sevenths selection for binding 
to CD36 (Fig. 21 Panel E and F). 

A majority of RBC infected with 3D7 not expressing VAR4 on the surface bound to CD36 
30 (Fig. 22, Panel A & C) as compared to 3D7 SM expressing VAR4 on the surface (Fig. 22, 
Panel B & D). 

In conclusion, these results indicate that parasites expressing the VAR4 (VSA S m) phenotype 
do not adhere to CD36, whereas parasites that do not express VAR4 do bind CD36. 

35 

Example 23: 

From an in vitro study (Roberts et al. 1992) of P. facliparum it has been estimated that 
antigenic variation result in 2% of parasite population switching away from the original 
40 antigenic type per generation. PfEMPl is believed to be the major contributor to this 
antigenic variation. To look at the stability of the transcription and expression of 
var4/VAR4 (SEQ ID NO.: 1 and 2) 3D7 parasites with a VSA S m was left in culture for 60 
generations. 
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m23-l. A 3D7 UM and freshly selected 3D7 SM (Example 3, m3-l) was left In culture to drift 
for 60 generations. Both the 3D7 UM and the 3D7 sM-drifter was late-staged enriched by 
gelatine flotation at regular intervals. FACS analysis using antiserum against DBL55 
and CIDRlct of VAR4 (SEQ ID NO. : 2) and analysis of var gene transcription was 
5 done as described in Example 6. 

m23~2. The absolute copy number of each var gene was calculated based on the 

standard curves described in Example 6. The total number of var gene copies was 
was calculated by summing up the absolute copy number of each var gene minus 
the abosolute number of PFI1830c and PFE1640W (varl) transcripts. PFI1830c and 

10 PFE1640w has previously been shown to be transcribed at higher levels in 

trophozoite/schizont-stage parasites in contrast to all other var genes, which are 
transcribed at higher levels in ringstage parasites. The contribution of each var 
gene (excluding PFI1830c and PFE1640w) to the var gene transcription was 
calculated in percentage ([Absolute copy number of "specific var gene"]/[total var 

15 gene copy number*100%]). 

This analysis showed that surface expression of VAR4 remains stable for at least 44 
generations (Fig. 23 Panel A-E). At the start of the experiment the var4 gene was found to 
be the major transcript in the freshly selected 3D7 SM constituting 19.6% to the total var 
20 gene transcription. Following 28 generations var4 constituted 15.6% to the total var gene 
transcription. This percentage remained constant for another 16 generations and then 
started to decrease. Following 60 generations the var4 gene transcription constituted 4.7% 
to the total var gene reportoire, which was reflected in a decrease surface expression of 
VAR4 (Fig. 23 Panel A). 

25 

Example 24: 

To test whether high levels of antibodies to VAR4 (SEQ ID NO.: 2) is associated with 
protection from malaria (e.g. reduced risk of developing febrile malaria, reduced risk of 
developing anemia) we looked at the prevalence and levels of anti-CIDRla VAR4 
30 antibodies by ELISA as described in Example 13. 

m24-l. Plasma samples from 569 individuals aged 0 to 59 years from two different 

Tanzanian villages Kwamasimba (moderate malaria transmission) and Mkokola 
(high malaria transmission) were tested in a VAR4 CIDRla ELISA as described in 
35 Example 13. Logistic regression models controlling for age, sex, village of 

residence, net use and parasite density were done using the STATA 8.2 Programme 
(Stata Corporation, Texas, USA; http://www.stata.com). 

This analysis showed (Table 3A) a decreased risk of developing anemia for individuals 
40 having higher levels of anti-CIDRla VAR4 antibodies (Odds ratio 0.34, 95% Confidence 
interval 0.23-0.5). In children below 5-years of age, the model (Table 3B) showed a 
reduced risk of developing febrile malaria with high levels of anti-CIDRla VAR4 antibodies 
(Odds ratio 0.28, 95% Confidence interval 0.10-0.78). 
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These results indicate that the presence of VAR4 antibodies is associated with protection 
from malaria morbidity. Thus, these results strengthen the vaccine candidacy of VAR4. 

Table 3 

5 



A 1 



Anaemia 


Odds Ratio 


Std. Err. 


z 


P>z 


[95% Conf. Interval] 




Presence of anti-CIDRlct 


0.34 


0.07 


-5.35 


0.00 


0.23 


0.50 




Age (years) 


0.95 


0.01 


-6.37 


0.00 


0.94 


0.97 




sex 


0.69 


0.13 


-1.98 


0.05 


0.48 


1.00 




Village 


3.01 


0.62 


5.36 


0.00 


2.01 


4.50 




Use of net 


0.90 


0.18 


-0.51 


0.61 


0.61 


1.33 




1 Logistic regression including 569 individuals aged 0-59 years of age 
B 2 








FEBMALA 


Odds Ratio 


Std. Err. 


z 


P>z 


[95% Conf. Interval] 


Presence of anti-CIDRla 


0.28 


0.15 


-2.45 


0.01 


0.10 


0.78 


Age (years) 


1.27 


0.18 


1.73 


0.08 


0.97 


1.68 


Sex 


0.66 


0.22 


-1.24 


0.22 


0.34 


1.28 


Village 


3.84 


1.46 


3.53 


0.00 


1.82 


8.10 


Use of net 


0.71 


0.25 


-0.98 


0.33 


0.35 


1.42 



10 2 Logistic regression including 216 individuals less than 5 years of age 
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